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More and more, programmers and work- 
station builders are using DESQview 2.0 as a 
development tool. The reason is simple. 
They can create powerful, multitasking 
solutions today for the millions of DOS PCs 
in use today. Solutions comparable to those 
promised for tomorrow by OS/2. 


The API Advantage 
Programmers who take advantage of DESQview’s API 
(Application Program Interface) get access to the powerful 
capabilities built into DESQview—multitasking, window- 
ing, intertask comunications, mailboxes, shared programs, 
memory management, mousing, data transfer, menu- 
building and context sensitive help. 


Bells and Whistles 
A program taking advantage of the DESQview 2.0 API can 
spawn subtasks for performing background operations or 
new processes for loading and running other programs 
concurrently. It can schedule processing after an interval or 
at a certain time. It can use DESQview’s intertask commu- 
nications to rapidly exchange data between programs, 
share common code and data; or interrupt at critical events. 
It can use DESQview’s menuing and mousing capabilities 
to create menus. And there’s lots more it can do. 
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Some of the applications under 
development right now using 
DESQview 2.0 API Tools: CAD, 
Medical systems, insurance, 3270 
mainframe communications, 


network management, real 

estate, typesetting, point of sale, 
education, commodity trading, 
stock trading and online voting. 


80386 Power 


80386 programmers can take advantage of 
the 80386’s protected mode for large 
programs, yet run on DOS and multitask in 
DESQview-side by side with other 80386 
and DOS programs. The breakthroughs that 
make this possible: DOS Extenders from 
PharLap Software and AI Architects and 
DESQview support of these DOS extenders. 


DESQview Developer Conference 


So if you are a developer, looking to create programs with 
mainframe capabilities, but wanting to sell into the existing 
base of millions of DOS PCs, come to Quarterdeck’s first 
DESQview API Developers Conference, August 16-18, 1988 
at the Marina Beach Hotel, in Marina del Rey, California. 
For more information call or write us. 


Come learn about the DESQview 2.0 API and 80386 DOS 
Extenders. Meet 80386 experts as well as those smart 
people who are creating DESQview 2.0 API workstations 
solutions. 


And if you want to get a leg up before the conference, ask 
us about the DESQview API Tools for assembler or C 
programmers. 


New Power to DOS. 
ew 2.0 API Toolkit. 


Quarterdeck Office Systems 150 Pico Blvd.,Santa Monica, CA 90405 
(213) 392 9851 
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work done before 


labels while you're writing a report in Word 
Perfect, or laying out a newsletter in Ventura 
Publisher, or designing a building in AutoCAD. 
DESQview even lets you transfer text, 
numbers, and fields of information between 


The future of personal computing is clear. More 
powerful PCs. Easier to use PCs. With graphics 
and character-based programs working side by 
side. Talking to each other. Multitasking. Win- 
dowing. Menuing. Mousing. Getting your work 


done easier and faster. programs. 
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We've spent years developing our 
file manager so you wont have to. 


You could invest hundreds of programming 
hours writing a file management system for your 
next application. 

Or you could simply invest in Btrieve? 

And get all the file handling functionality you need, 
through subroutine calls from your favorite 
programming language. 

Portable. Write your application once. 
Wherever Btrieve runs, your application will run, 
whether ina single user or multiuser environment. 


® 


In fact, Btrieve is the standard access to NetWare® 


Fast. Written in assembly language, Btrieve 
uses b-tree indexing algorithms with caching 
and automatic balancing for fast, efficient file 
management. 

Safe. Btrieve is the only fault tolerant file 
manager with built-in file recovery. In the event 
of a system or power failure, your database is 
protected. 

Flexible. Develop applications with the 
capabilities you need most. Like 255 open files, 
unlimited records per file, 24 indexes per file, and 


a maximum file size of up to four gigabytes. You 
can access Btrieve from BASIC, C, Pascal, 
COBOL and others. 

Invest in Btrieve. At just $245 for single user 
and $595 for multiuser, it’s a small price to pay for 
all the file manager you'll ever need.* And you'll 
never pay royalties on the applications you devel- 
op. To find out more, see your authorized Novell 
reseller, or call (512) 346-8380. 

For more information, call from your modem 
1-800-444-4472 (8 bit, no parity, 1 stop bit) and 
enter the access code NVBT13. 


For software solutions, 
you should be seeing red. 


*Suggested retail price (US dollars) ©1987 Novell, Inc., World Headquarters, 122 East 1700 South, Provo, Utah 84601 (801) 379-5900 
Requires PC-DOS or MS-DOS 2.X, 3.X or Xenix. 
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Technical Brief: 0S/2 or UNIX? 

How are UNIX and OS/2 the same and where do they differ? This 

technical brief outlines the differences, beginning with the user graphic 

interface right through to porting DOS applications. 
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Optimizing UNIX’s Serial Character I/0 

Serial character input/output is the life’s blood of any multiuser system, so 
enhancing character processing is the key to improving system 
performance. 
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Programming the 8086/8088 To Perform 

When writing programs close to the CPU, the ongoing challenge to the 

computer programmer is to decide when to optimize programs for size and 

when to optimize for speed. In this article, Kevin Parker delves into the 

rules that save time and space when programming for the 8086/8088. 
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PRODUCT REVIEWS 


UNIX on the PC 

SCO’s Xenix, Microport System’s V/386, and Interactive Systems’ 386/ix 

offer three alternatives for running UNIX on the 386-based PC. 
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Running DOS Under UNIX 

Switching to a UNIX operating platform does not mean you have to 
forsake DOS. This article looks at Merge 386 and VP/ix, two packages 
that allow you to run DOS as a task in a UNIX multitasking environment. 
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About the cover: It is the dawn of a 
new age for the PC. The advent of 386 
chip technology now makes it possible 
to harness the power of the mainframe 
or minicomputer on the desktop. In this 
issue we will explore ways to push your 
PC to the edge of the processing enve- 
lope with a look at UNIX alternatives 
for the 386. 
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FROM THE EDITOR’S DESK 


UNIX Challenges OS/2 


ast year Intel shipped an estimated 600,000 80386 microprocessors. 

This year it is expected to ship close to 3 million 386 chips. Most of those 

chips are going into PC-compatibles, and a very significant percentage 
will not be running either DOS or OS/2. Rather, they will use UNIX as their 
operating system. 

UNIX first became available on PCs way back in 1984, soon after the XT was 
introduced. IBM even introduced a version of UNIX for the XT that ran a 
three-user system. However, the XT’s 10-Mbyte hard disk drive, slow speed, 
and small memory made UNIX more of a curiosity (some preferred to call it a 
joke) than a usable product. 

When the AT was introduced, UNIX implementers thought that here, finally, 
was a desktop system that could support a small multiuser UNIX system. 
Several UNIX implementations were introduced. However, the improved horse- 
power of the AT was soon found to be too lacking for anything but the smallest 
multiuser systems. 

The introduction of fast 80386 systems, megabyte-size memories, and hard 
disks capable of storing 100 megabytes and up appear to finally have provided 
the necessary performance and architecture to build low-cost desktop UNIX 
systems that compare favorably with mainframe UNIX implementations. Prices 
for multiuser UNIX systems based on this 386 hardware now compete favorably 
with local area networking systems. Three such UNIX systems are reviewed 
in this issue. 

IBM and Microsoft, the two most powerful marketing entities in the com- 
puter business, are putting their combined forces behind OS/2. Their market- 
ing wherewithal is most formidable. On the other hand, Sun Microsystems 
and AT&T have joined forces and are enlisting the aid of other companies, such 
as Unisys and Motorola, to develop a standard UNIX. This would overcome 
many of the incompatibility problems currently associated with UNIX. 

OS/2 was written for 286-based machines and hence does not take advantage 
of the 386’s architecture; yet OS/2 needs the 386’s performance. On the other 
hand, many UNIX implementations exploit the 386’s features. Add to that the 
ability to manage large memory, graphical user interfaces, and a powerful LAN 
interface, and UNIX presents a formidable competitor to OS/2. Also, there are 
many good UNIX applications already available while the availability of OS/2 
applications has a long way to go to catch up. 

It will be some time before we see a 386 version of OS/2. In the meantime, 
the UNIX vendors have a window of opportunity. And, they appear to be taking 
advantage of it. 

Several UNIX vendors have also introduced a DOS facility for their UNIX 
implementations that allow many standard DOS applications to run in a true 
multiuser/multitasking environment. OS/2, on the other hand, can run stan- 
dard DOS application software, but only in a single-tasking mode called the 
“compatibility box.” OS/2 requires that applications which can be multitasked 
be written specifically for OS/2. On 386 systems, UNIX with DOS tasking 
facilities utilizes the 386’s 8086 real mode, which means standard, unmodified 
DOS tasks can be run in a true multitasking environment. Many users are 
choosing this approach to multitasking standard DOS applications. Two such 
implementations are reviewed here. 

We think you will find this issue of Micro/Systems provides a window into 
things to come, and one of the most interesting issues we have published. We 
found it rewarding to put it together for you and hope you will find it just as 


rewarding to read. 
G/ Like 
“ Sol Libes 


Founder and Editor 


4 Micro/Systems 


For the PC Systems Integrator 


MicrotSystems 


URNAL 


EDITORIAL 


Founder and Editor 
Technical Editors 


Associate Editors 


Contributing Editors 


Managing Editor 


Sol Libes 
Stephen R. Davis 
Don Libes 

Lennie Libes 
Susan Libes 
A.G.W. Cameron 
Patrick Corrigan 
PL. Olympia 
Thomas M. Woolf 


PRODUCTION 


Art & Production 
Director 


Art Director 
Asst. Art Director 
Typographer 


Larry L. Clay 

Joe Sikoryak 
Barbara Mautz 
Lorraine Buckland 


CIRCULATION 


Director of Circulation 
Asst. Circulation Mgr. 
Direct Mktg. Specialist 
Fulfillment Coord. 
Newsstand Mgr. 


Maureen Kaminski 
Andrea Weingart 
Kathleen Shay 
Francesca Martin 
Sarah Frisbie 


ADMINISTRATION 


V.P. Finance & 
Operations 

Business Mgr. 
Accounting Supv. 
Accts. Payable Asst. 
Accts. Receivable Asst. 


Kate Wheat 

Betty Trickett 

Mayda Lopez-Quintana 
Luanne Rocklewitz 
Wendy Ho 


ADVERTISING 
Advertising Director Richard Mixter 
National Account Mgr. Dwight Schwab 
National Account Mgr. Tami Brenton 
Advertising Coord. Shaun Hooper 


M&T Publishing, Inc. 


Chairman of the Board Otmar Weber 
Director C.F. Von Quadt 
President & Publisher Laird Foshay 
V.P. of Publishing William P. Howard 


Micro/Systems Journal (ISSN 8750-9482) is published 
monthly by M&T Publishing, Inc., 501 Galveston Drive, 
Redwood City, CA 94063; (415) 366-3600. Second-class 
postage paid at Redwood City and at additional entry 
points. 

Article Submission: If you have a specific area of 
expertise or interest and would like to contribute, please 
write Micro/Systems Journal, P.O. Box 1192, Mountain- 
side, NJ 07092; (201) 522-9347, or contact M&T Publish- 
ing, Inc., 501 Galveston Drive, Redwood City, CA 94063; 
(415) 366-3600. Please do not submit articles without 
first contacting the editors. 


Correspondence: Please send letters to the editor to 
Micro/Systems Journal, 501 Galveston Drive, Redwood 
City, CA 94063. Other editorial correspondence may 
also be directed to P.O. Box 1192, Mountainside, NJ 
07092. The editors may also be reached via MCI Mail 
(SLIBES or MSJ). 

Advertising Rates: Available upon request. Call (415) 
366-3600 or write to: Advertising Department, Micro/ 
Systems Journal, 501 Galveston Drive, Redwood City, 
CA 94063. 


Aucust 1988 


SUN 


~ DESIGN —£INISD |. 
eee 


27 


MAGIC PC ELIMINATES CODING ... CUTS MONTHS OF DATABASE DEVELOPMENT! 


Time is money. And coding a DBMS 
application like Accounting or Order 
Entry takes a lot of both. Simply be- 
cause hacking out mountains of code 
with your RDBMS or 4GL is too 
slow. Not to mention the time to re- 
write if you make a mistake or change 
the design. 


EXECUTION TABLES 
ELIMINATE CODE! 
Magic PC cuts months of your appli- 
cation development time because it 
eliminates coding. You program with 
the state-of-the-art Execution Tables 
in place of conventional programming. 


HOW DOES IT WORK? 
Magic PC turns your database design 
scheme directly into executable appli- 
cations without any coding. Use Exe- 


cution Tables to describe only what 


your programs do with compact design 
spec’s, free from lengthy how to pro- 
gramming details, Each table entry is 
a powerful non-procedural design in- 
struction which is executed at com- 
piled-like speed by a runtime engine. 
Yet the tables can be modified “on the 
fly” without any maintenance. De- 
velop full-featured multi-user turn- 
key systems with custom screens, 
windows, menus, reports and much 
more in days — not months! No more 
low-level programming, no time 
wasted... 


MAGIC PC 


The Database Language 


atotesin “Magic PC’s database en- 
St gine delivers powerful app- 
- lications in a fraction of 
the time... there is nocom- 
petitive product.” 
“Overall, Magic PC is one 
of the most powerful DBMS 
packages available.” 


® Quick Application Generator 

© BTRIEVE® — based multi-user RDBMS 
® Visual design language eliminates coding 
® Maintenance-free program modifications 
© Easy-to-use Visual Query-By-Example 
®@ Multi-file Zoom window look-ups 

® Low-cost distribution Runtimes 

® OEM versions available 


ATTENTION BTRIEVE® USERS 
Now you can quickly enhance your BTRIEVE®- 
based applications beyond the capabilities of 
XTRIEVE® and RTRIEVE®. Use Magic PC as 
aturn-key BTRIEVE® Application Generator to 
customize your applications without even chang- 
ing your existing code. 


= 
Hie 
19782 MacArthur Boulevard, Suite 305 
Irvine, California 92715 
TLX: 493-1184 FAX: 714-955-0199 


DATABASE PROGRAMMERS 
Join the thousands of professional 
database programmers and vertical 
market developers who switched to 
Magic PC from dBase®, R:BASE®, 
Paradox®, Clipper®, Dataflex®, Rev- 
elation®, Basic, C, Pascal, etc. 


TRY BEFORE YOU PAY 


We're so sure you'll love Magic PC — 
we'll let you try the complete package 
first. Only a limited quantity is avail- 
able, so call us today to reserve your 
copy. Pay for Magic PC only after 30 
days of working with it.* To cancel... 
don’t call... simply return in 30 days 
for a $19.95 restocking fee. 


OR PAY NOW AT NO RISK 
Pay when you order and we'll wave 
the $19.95 restocking fee so you have 
absolutely no risk. 


SPECIAL OFFER S69 


1199.* 


Magic LAN multi-user — $399 
Magic RUN — call for price 


Order Now Call: 
800-345-MAGIC 


In CA 714-250-1718 


MS 
Add $10 P&H, tax in CA. International orders add $30. 
*Secured with credit card or open P.O. Valid in US. 
Dealers welcomed 


CALL ADVERTISER DIRECTLY 


Requires IBM/100% comp., 512K, hard disk. DOS 3.0 or later. Includes BTRIEVE runtime. Not copy protected. 2 5.25-inch disks. All trademarks acknowledord © Conuriaht 1002 & tan Cnn 


News & Views 


by Sol Libes 


Random Rumors & Gossip 
Hewlett-Packard is rumored to be 
readying a new laser printer with an 
expected list price of $1,500. This 
means that the street price should 
be close to $1,000. 

Apple Computer Inc. has an- 
nounced that it will support Micro- 
soft’s LAN Manager running as a cli- 
ent workstation. And there are ru- 
mors that Apple will soon introduce 
an SE+ that supports color and uses 
a 68020. 

AT&T has disclosed that it will ship 
beta UNIX V.4 source code in the first 
quarter of next year, nine months 
ahead of schedule. AT&T will also 
publish its UNIX System V Interface 
Definition (SVID) in the fourth quar- 
ter of 1988, also ahead of schedule. 
These disclosures are no doubt a re- 
sponse to the creation of the Open 
Software Foundation created by its 
UNIX competitors. 

Compaq Computer is expected 
to finally enter the laptop competition 
with high-performance 286- and 386- 
based systems. Prototypes that have 
been shown to select customers weigh 
about 12 pounds and include 40- 
Mbyte, low-profile hard disks, recharge- 
able batteries, 1.44-Mbyte, 3'/2-inch 
floppies, 1-Mbyte of RAM and double 
supertwist LCD displays. 

IBM has disclosed that it “plans 
to deliver a PS/2 capable of parallel 
processing before the year is out.” 
In all likelihood, this will be a smart 
disk drive controller using the Micro- 
Channel’s multi-master feature— 
IBM’s first product to use this much- 
publicized feature. 


286 & 386 Clock Rates Jump 
Most of the new AT-compatible ma- 
chines now feature 20-MHz, 286 proc- 
essors. Some vendors are boasting 
that these machines outperform many 
386-based machines when coupled with 
disk caching and full-track buffering. 
Most vendors are using 16-MHz, 
286 chips from Harris and AMD and 
are screening them for reliable per- 
formance at 20 MHz. Intel’s fastest 
286 chip is still only rated at 12.5 
MHz. Both Harris and AMD are ex- 
pected to soon start shipping 286 chips 
rated at 20 MHz. This probably means 
that system vendors will soon offer 
AT compatibles running at 25 MHz. 
Intel is now shipping small quanti- 
ties of 386 chips rated at 25 MHz to 
selected customers (mostly their own 
personal computer group). Shipments 
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to OEMs are expected to begin by 
year-end so that next year 25 MHz 
should become the standard rate for 
386-based systems. 

Intel has disclosed that it will de- 
liver initial samples of an 80386 proc- 
essor chip rated to run at 33 MHz to 
a select group of OEMs late this year. 
Intel is also expected to ship sample 
386 motherboards using the chip be- 
fore year-end. This probably means 
that we will see initial production ma- 
chines running at 33 MHz by the 
middle of next year and that in 1990 
this should become the de facto speed 
for 386 systems. 

The standard 386 system sold next 
year should come with 4 Mbytes of 
memory on the motherboard (up- 
gradeable to 16 Mbytes), include a 
40-Mbyte hard disk, and have a street 
price of around $2,500. 

Prior to the Intel disclosure, Mo- 
torola had announced that it would 
provide 33-MHz, 68030 samples in 
August and production parts by year- 
end. These parts will most likely be 
used by the UNIX system vendors, 
such as Sun Microsystems, as early 
as the first quarter of ’89, while Apple 
will probably wait until late in the 
year to announce high-speed versions 
of the Mac II. 


PS/2 Clone Update 

Compaq and Zenith Data Systems 
have both publicly stated that they 
do not intend to introduce PS/2 clones 
employing IBM’s patented MicroChan- 
nel Architecture. Both cited the MCA’s 
poor performance as the reason. An- 
drew Czernek, a Zenith vice presi- 
dent, went so far as to say, “The MCA 
is an inferior product with inferior 
performance; We are not going to be 
swayed by the hype.” 

And Mike Swavely, a Compaq vice 
president said, “The PS/2 has been 
unable to deliver any significant ad- 
vantage over what the standard PCs 
deliver.” He went on to say that “the 
market hype that IBM put out about 
multiprocessors is really inaccurate. 
Every interrupt from any I/O device 
of any type gets processed with every 
memory cycle. There is not enough 
[bus] bandwidth to support [multiple 
processors].” 

Compaq has acknowledged that it 
has MCA technology under develop- 
ment and that it is negotiating PS/2 
patent licensing with IBM. There is 
no doubt that they will be watching 
the market’s reaction to the Tandy 


and Dell Computer PS/2 compatibles 
closely, and if there is sufficient de- 
mand, they will also introduce such 
machines. 

It is apparent that the old AT bus 
architecture still has sufficient band- 
width to match that of the current 
PS/2 machines. The dual-bus expan- 
sion for 32-bit memory adopted by 
Compaq and others for 386-based ma- 
chines has enabled manufacturers to 
improve AT-bus machines perform- 
ance beyond that of IBM’s PS/2 ma- 
chines. AST has already introduced a 
multiprocessor extension to the AT 
bus that may be adopted by other 
AT-bus machine manufacturers. 


The UNIX Standard Battle 
AT&T may have created UNIX, but 
they lost control over it when they 
licensed it to companies such as IBM 
and Microsoft who, in turn, relicense 
dit to distributors and resellers. Added 
to that are the enhanced versions 
from university labs. The result is 
creating problems for software devel- 
opers, UNIX system vendors, and end 
users who need software portability. 
Binary file portability has proven to 
be a major problem and source code 
portability has created a thriving busi- 
ness for VARs and consultants. 

UNIX vendors have long argued for 
a tightly controlled UNIX standard 
such as exists for DOS, OS/2, and 
Apple’s Macintosh operating systems. 
Recently Sun Microsystems and AT&T 
entered into an agreement to develop 
anew “standard” version of UNIX and 
a “standard” graphical user interface 
called “Open Look.” More than 50 
UNIX system vendors, including Xerox 
and Unisys, agreed to support the stan- 
dard. And software developers such 
as Lotus and Ashton-Tate have agreed 
to port applications to Open Look. 

However, several key UNIX ven- 
dors are quite upset with the AT&T/ 
Sun effort and have launched their 
own “Open Software Foundation” to 
develop another “standard” UNIX and 
user interface. The foundation includes 
such key UNIX players as IBM, DEC, 
Hewlett-Packard, and Apollo. It is un- 
usual for these heavyweights to coop- 
erate on anything, which indicates 
that they are seriously threatened by 
the AT&T/Sun effort. IBM and DEC 
want to keep their customers locked 
into their own operating systems and 
hence certainly do not want a UNIX 
standard that would allow users to 
buy systems on price and perform- 
ance. HP and Apollo are firmly com- 
mitted to UNIX and are fearful that 
AT&T has given Sun an inside track. 

One thing is for sure—UNIX growth 
will continue to be hampered by the 
problem of standards. oO 
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THE C FORUM 
by Don Libes 


Speeding Up strcpy 


e use strcpy all the time and 

take it for granted. We as- 

sume that it not only works, 
but that it is optimally fast. The C 
idiom for string copying is stated so 
simply, it is hard to see at first how 
it could possibly be improved: 


while (*dest++=*src++) ; 


Wrapping this phrase with formal pa- 
rameters, and arranging for the re- 
turn value gives us a complete strcpy. 


char *strcpy (dest,src) 
char *dest, *src; 
{ 


char *s = dest; 
while (*dest++ =*srct++) ; 


return (s); 


Surprisingly, this is not necessarily 
the fastest implementation of strcpy 
for most machines and most compil- 
ers. For example, some machines have 
astrcpy-like machine instruction. This 
obviously makes things much easier. 
For now, however, I will discuss the 
situation most of us face. 

A dramatic improvement can be 
made by having the compiler put the 
while loop variables in registers. This 
can be done by modifying the pa- 
rameter declaration so that it reads: 


strcpy (dest,src) 
register char *dest, *src; 


Many compilers pass arguments 
on the stack and cannot obey this 
request directly. However, some com- 
pilers simulate it as if the code was 
written as follows: 


strcpy (dest2,src2) 


Don Libes is a computer scientist based 
in the Washington, D.C., area working 
in artificial intelligence and robot con- 
trol systems. 
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char *dest2, *src2; 

{ 
register char *dest = dest2; 
register char *srce =src2; 


Other compilers are not so smart. 
It may seem obvious how to optimize 
when looking at a simple subroutine 
like strcpy, but subroutines with many 
variables cannot be optimally com- 
piled. Thus, it is generally not worth 
using register declarations on formal 
parameters unless you are coding for 
a particular compiler. 

Notice that strcpy returns its first 
argument. It has never been clear to 
me why this is, since returning the 
end of the string is much more useful. 
For example, if you are going to ap- 
pend existing strings to a string you 
have just created, you will need to 
know where to start from. The only 
way to get this information is with 
strlen. strlen has to walk across every 
character in the string, which is ex- 
actly what strcpy does. What is the 
point of returning something that we 
obviously had to know to call the 
function in the first place? It is easy 
to fix strcpy. Let’s call it strepye. 


/* strepy, but returnendof dest */ 
char *strcpye (dest,src) 

char *dest, *src; 

{ 


while (*dest++=*srct+t) ; 


return (dest-1); 


Having a pointer to the end of a 
string is very useful. For example, 
we can calculate the length of the 
string by subtracting the beginning 
from the end. We can also reverse 
this operation assuming we have the 
length and want to calculate the end 
of the string. Sometimes we may al- 
ready know the length of a string 
without having to call strlen or an- 
other string routine. In that case, we 
can do string copying even faster. 

Many machines have single ma- 
chine instructions that can copy a 


given number of bytes. These are 
available through the C library func- 
tion called memcpy. bcopy is another 
name used by some systems, but it 
does the same thing as memcpy. 
memcpy is the name given this func- 
tion by the ANSI C standard. 

memcpy is faster than strcpy, so if 
you already have the length or end 
of a string, you should call memcpy 
instead of strcpy. Unfortunately, both 
memcpy and strcpy handle overlap- 
ping strings in implementation- 
defined ways. This is noted by the 
ANSI C standard. 

Using the strcpy we defined earlier, 
the result of the following code leaves 
s with the string “model.” 


char s[80] ="/model"; 


/* strip leading slash */ 
strcpy(s,s+1); 


Attempting to add space into the string 
fails: 


strcepy(st+1,s); 


In particular, s will be filled with 
slashes, and strcpy will attempt to fill 
all remaining memory with slashes. 
The problem is that strcpy copies each 
character over the next one that it is 
about to use. It will never see the 
terminating null because it will have 
already written over it with a slash. 

Reversing strcpy so that it copies 
rear to front fixes this problem but 
then breaks strcpy(s,s+ 1). Reverse copy- 
ing is also slower since we have to 
find the end of the string. One solu- 
tion is to compare the pointers to 
each other. If the source is greater 
than the destination, strcpy runs from 
the front of the string to the rear, 
otherwise it runs from the rear to the 
front. Let’s call this strcpyo. 


/* strcpy that handles overlap */ 
char *strcpyo (dest,src) 
char *dest, *src; 
{ 
return (src>dest? 
strcepy forward (dest,src): 
strcpy_backward(dest,src) ); 


A lot of string processing is done 
by ad hoc code because the string 
routines in the libraries don’t quite 
match the programmer’s require- 
ments. My comment about returning 
string ends rather than string begin- 
nings is one such example. Another 
is requiring strcpy to behave a par- 
ticular way when dealing with over- 
lapping strings. This is an unfortu- 
nate fact of life spelled out by the ANSI 
C standard. 
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UNIX'meets DOS. And a powerful solution is born. 


Until now, UNIX users coveted the wealth of application 
software available on DOS. While DOS users envied the power 
of their UNIX counterparts. 

Now Locus Computing gives every user the best of both 
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PC-Interlace. Merge 386 and Xsight are trademarks of Locus Computing Corporation. UNIX is a registere: 
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centralized files stored on a UNIX host. Or work in a UNIX 
environment from their PC workstation. It’s their choice. 

Xsight™ windowing software gives PC workstations and host 
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FAX: (213) 450-1432. And marvel 
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Making sarcpy Fast on a RISC 
Complex machine architectures dem- 
onstrate the problems of producing 
optimal code. For example, the VAX 
has a single instruction that can copy 
blocks of arbitrary length, and an- 
other that can locate the ends of null- 
terminated strings. Using these, it 
becomes very easy to write an effi- 
cient strcpy (as well as strlen, strcat, 
and others). The first instruction finds 
the end of the string, and the second 
uses it to do the copy. That’s two 
machine instructions! 

In the case of the VAX, this worked 
fine. However, when a new machine 
was introduced, the MicroVAx II, it 
turned out that the engineers couldn’t 
fit all the old VAX microcode into the 
microcode space of the MicroVAX. 
The obvious solution was to emulate 
the instructions in software. This 
worked fine except that strcpy now 
executed much more slowly than the 
original test-and-branch loop! The rea- 
son was that the optimized (two- 
instruction) version traversed the 
source string twice, while the unop- 
timized (C while loop) version trav- 
ersed it only once. 

This demonstrates a general prob- 
lem of complex architectures, often 
referred to as CISC (for Complex In- 
struction Set Computers); they cause 
problems for compiler writers trying 
to create optimal compilers and li- 
braries. Implementors generally have 
to learn large sets of complex instruc- 
tions and gain a much deeper level 
of understanding about how they exe- 
cute. For example, instruction tim- 
ings vary, depending upon context, 
because of caching and pipelines. 


While some of these factors affect 
RISC machines as well, all RISC in- 
structions take one cycle (well, al- 
mostall do), which makes instruction 
choices much simpler. The result is 
that the time you spend optimizing a 
compiler for a RISC machine will be 
less than for a CISC machine. 

When the next version of your RISC 
machine appears, you won’t have to 
spend time rewriting the optimizer 
to account for new instruction tim- 
ings. The VAX—clearly a CISC ma- 
chine—presentsexactly this problem 
because there are so many models. 
The solution is to make the code 
straightforward, and avoid the 
specialized instructions that are slow 
or aren’t implemented on some 
machines. 

By analogy, you can use the same 
reasoning for per-compiler optimiza- 
tion. Since your C code may run on 
many different compilers, operating 
systems, and machines, it is often 
better to write straightforward code 
that maximizes portability. Super- 
optimizing code is difficult. Are you 
sure it is better to code it this way 
on an 8088/8086? 80286? 80386? 
68000? VAX? SPARC? Cray? Let the 
compiler do the optimization. 


Loop Unrolling 

People familiar with compiler writing 
have doubtless seen the following 
trick. The C implementation was first 
attributed to Tom Duff of Bell Lab- 
oratories and is known as “Duff’s 
Device”: 


switch (n&7) { 
do { 


«Gives you all the right stuff 

for debugging! No matter which 
model you pick, you have the 
same powerful software to help 
you track down hard-to-find bugs 
fast, 
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*dest++ = *srctt; 


case 7: *destt+ = *srctt+; 
case 6: *dest++ = *srct+t; 
case 5: *dest++ = *srctt+; 
case 4: *dest++=*srctt; 
case3: *destt+t+=*srctt+; 
case 2: *dest++ = *srct+t+; 
case l: *dest++ = *srct++; 
case 0: ; 


} while (0 <=n-=8); 


My compiler said, “warning: loop 
not entered through top,” but com- 
piled it correctly. Aside from 
demonstrating the bizarreness al- 
lowable with switches, this piece of 
code forces an optimization known 
as loop unrolling. The idea is that 
code you expect to loop many times 
can be sped up by not checking the 
termination condition each time 
through the loop. In this implementa- 
tion, we check once every eight as- 
signments. We handle the remainders 
separately. 

For example, if » is 10, the switch 
branches to case 2. Case 2 falls into 
case 1, thereby executing two assign- 
ments (notice the missing breaks), 
and the while returns us to the top of 
the loop to execute the remaining 
eight assignments with no intervening 
tests. 

This would make the heart of a 
very fast memcpy. Notice that it is not 
dependent upon the data types. You 
can copy by char, ints, longs, or even 
structs—whatever you decide. How- 
ever, there are obvious trade offs you 
will have to consider. For example, 
while larger types can store as quickly 
as smaller types, larger types often 
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require alignment or you pay a speed 
penalty. Compilers can do loop un- 
rolling automatically, but only if they 
are assured of a payoff. This is diffi- 
cult in C programs, because so many 
miscellaneous functions call memcpy. 
You might like to create a second 
version of memcpy that has the opti- 
mization in it. Depending on the situ- 
ation, you will want to call the copy 
or the original. 


A Real Fast sircpy 

Why did I bring up loop unrolling? 
Partly to demonstrate it, but also to 
improve strcpy. strepy can use loop 
unrolling, although it is a little trick- 
ier than memcpy. The actual assign- 
ment is easy; the hard part is the 
termination condition. 

Suppose we are copying four bytes 
at a time. We want to stop if any of 
the four bytes are null. The obvious 
test is: 


(*x==0) [1 (*(x+1) == 0) TI 
(* (x+2) ==0) |] (* (x+3) == 0) 


Unfortunately, implementation of 
this expression is more expensive than 
the simple test we were hoping to 
avoid. Fortunately, there is a way to 
detect null bytes in a multibyte con- 
stant with only a few machine instruc- 
tions. The following macro does the 
job: 


#define has_null_byte(x) \ 
((x - 0x01010101) &~(x) &\ 
0x80808080) 


This particular version is written 
for four-byte unsigneds on a 2’s com- 
plement machine. The basic idea is 
that the subtraction propagates bits 
up to the most significant bit in each 
byte by successive carries. If you want 
to really understand this, try running 
an example by hand and watching 
how the bits move. 

Here is a straightforward rewrite 
of strcpy using this macro: 


#define has_null_byte(x) \ 
((x-magicl) &~(x) & magic2) 


char *real_fast_strcpy (dest, src) 
char *dest, *src; 
{ 
register long *d= (long *) dest; 
register long *s = (long *) src; 
register long magicl = 
0x01010101; 
register long magic2 = 
0x80808080; 
register char *dc, *sc; 


/* copy four bytes at atime */ 
while (!has_null_byte(*s)) { 
*d++=*st+; 


} 


/* prepare to copy remaining 
chars */ 

dc = (char *)d; 

sc = (char *)s; 

while (*dc++=*sct+) ; 


return (dest) ; 


real_fast_strcpy copies four bytes 
at a time, once each block is deter- 
mined to be null free. If there is a 
null, it just reverts back to the old 
behavior. Since this only happens close 
to the end of a string, there isn’t 
much penalty for this. 

By declaring d and s as longs (4 
bytes on my machine) I am able to 
have the compiler provide the fastest 
way of copying. This will turn out to 
be a single instruction on most ma- 
chines. 

Notice all the register declarations, 
and how they are ordered. s and d are 
going to be accessed the most fre- 
quently so they come first. I modified 
the macro so the magic numbers are 
stored in registers. Only if the com- 
piler has any registers left over, will 
the old-style strcpy get the benefit of 
using them. 

As I remarked earlier, heavily opti- 
mized code like this starts to sacrifice 
portability. In this case, I assume I 
know how longs are stored, assigned 
and mathematically manipulated. I also 
hope that I’m going to get several 
registers for this to pay off. The worst 
aspect of all is that I have to guarantee 
that none of my strings are butted 
up against the end of my address 
space, and if one is that it is aligned 
on a longword boundary. While such 
an occurrence is extremely unlikely, 
the algorithm will cause an address 
fault when has_null_byte attempts to 
look at several nonexistent bytes at 
the end of the string. 


sprintf 

One final note. While sprintf may seem 
like overkill for a lot of simple string 
manipulations, it is defined to return 
the total number of bytes written (ac- 
cording to ANSI C, others may vary). 
Adding this to the beginning of the 
string gives you the end of the string. 
This avoids exactly the problem I was 
complaining about earlier with strcpy 
and friends. If you are writing code 
that doesn’t need to be as fast as 
possible, don’t bother with the low- 
level functions—usesprinif for every- 
thing! oO 


Did you find this article particularly useful? 
Circle number 1 on the reader service card. 
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Essential C Utility Library (400 useful Cfunctions) . 2... ee ee $160 
Essential Communications Library (C functions for RS-232-based communication BYBLEMIS) sso sere ee GAS as ecm Gg RT & $160 
Greenleaf Communications Library (interrupt mode, modem control, MON SXOEB): 2.55 oF Bi a oe es oi wis we SE $150 
Greenleaf Functions (296 useful C functions, all DOS services)... 2 2. 2 ee ee kk $150 
OS/88 (U +«x-like operating system, many tools, cross-development from MS-DOS) ...................0. $150 
ME Version 2.0 (programmer’s editor with C-like macro language by Magma Software; Version 1.31 still SIS) fo 2x: an Soe $140 
Turbo G Graphics Library (all popular adapters, hidden lineremoval) . . 2... ee $135 
PC Curses Package (full Berkeley 4.3, menu and dataentryexamples) ... 2... 2... ee ee ee ee ee $120 
CBTree (B+tree ISAM driver, multiple variable-length keys) 2... 12. ee ee $115 
Minix Operating System (U*sx-like operating system, includes manual)... 2... 2 ee ee ee $105 
PC/IP (CMU/MIT TCP/IP implementation forPCs) . . 1... $100 
B-Tree Library & ISAM Driver (file system utilities by Softfocus) . . . 6. 2 ee $100 
‘The-Profiler (programiexecution profile tool)! 006 ue ak we Fw | BEDE aa TS Cas HSS $100 
Entelekon C Function Library (screen, graphics, keyboard, string, printer,etc.) . 2... 0. ee ee $100 
Entelekon Power Windows (menus, overlays, messages, alarms, file handling,etc.). 2... 2... ee $100 
TurboGeometry (library of routines for computational geometry)... 6. $90 
QC88 C compiler (ASM output, small model, no longs, floats or bit fields, 80+ functionlibrary) . . 2... 2... $90 
Wendin Operating System Construction Kit or PCNX, PCVMS O/SShells . 2 2 2 2. 2 ee ee ee $80 
C Windows Toolkit (pop-up, pull-down, spreadsheet, CGA/EGA/Hercules) 2. 2 1. 1 ee ee $80 
Professional C Windows (windows and keyboard functions) . . 2 6 6 1 ee ee $80 
JATE Async Terminal Emulator (includes file transfer and menu subsystem) . . 2... 2. ee ee ee $80 
MultiDOS Plus (DOS-based multitasking, intertask messaging, semaphores) . . . 2... 2. eee ee ee $80 
WKS Library (C program interface to Lotus 1-2-3 program & files) 2. 6 6. 6 ee $80 
Professional C Windows (lean & mean window and keyboard handler). . . 2... 2... ee ee ee $70 
Ip (flexible printer driver, most popular printers supported) . 2. 6. we ee $65 
Quihty:(interactive:C: interpreter) <4. sor 6.x sei ce cor Goa) ae Sa A) SE ee Oe BOK BUSI BOS NOG OR Go Gh EA SRS $60 
EZ_ASM (assembly language macros bridging Cand MASM) . . ©. 6 ee ee ee ee $60 
Piree (parse;tree management) 4. 245.6 444.34 14 EERE ESA Hine eGR ee Dee ee ee we $60 
HEEP!(pop-up:help system builder) 2c ig se 8S ERR AH SRA TSE EH AME BOE CBRE Se Zee as $50 
Multi-User BBS (chat, mail, menus, sysop displays; uses Galacticomm modem card)... 6... ee ee $50 
Make}(macros, all languages, Duilt-iiiriles)) se. exes, as on ow ee BW FAS OEE De aa HE $50 
Vector-to-Raster Conversion (stroke letters & Tektronix 4010 codesto bitmaps). . . 2... 6 eee ee ee ee $50 
Coder’s Prolog (inference engine for usewithC programs) .. . 2. 1 2 ee ee ee ee $45 
Virtual Memory System (least recently used swapping) «2 6 6 6s ee ee ee we eee eee es $40 
C-Notes (pop-up help for C programmers ... add yourown notes). . ©. 6 6 1 ee ee $40 
Biggerstaff’s System Tools (multi-tasking window managerkit) . . - 2. - ee ee ee ee $40 
PC-XINU (Comer's XINU operating systemfor PC) 2s 2G GOES SOMERS Pets Ba PEE AS See Se Dw $35 
CLIPS ‘(rule-based expert system generator, Version 4.1) 2 66 6 se 6 HR DOR Dw LR OG $35 
‘Tiny: Curses (Berkeley curses package)! 675 ek ee ee we Ps a A |] 4 GIG Se we eee $35 
TELE Kernel or TELE Windows (Ken Berry's multi-tasking kernel & window package) . . 2... 2...) eee ee ee $30 
Clisp (Lisp interpreter with extensive internals documentation)... . 6. 6 6 6 ee ee ee $30 
Translate Rules to C (YACC-like function generator for rule-based systems) . 2 2. 1 6 we ee ee ee ee $30 
6-Pack of Editors (six public domain editors for use, study & hacking)... 2. 1 ee ee ee $30 
Crunch Pack (a dozen file compression & expansion programs)... - 6 ee ee ee $30 
ICON (string and list:processing language, Version 7) «5 6 ee eS ee EE Ee ES $25 
FLEX (fast lexical'analyzer generator; new, improved LEX) «6 is we HS SE SHS THT Hews we OS $25 
LEX (lexical analyzer generator, an oldie buta goodie) . © 6 1 6 ee $25 
Bison & PREP (YACC workalike parser generator & attribute grammar preprocessor). . . 6 6 6 6 ee ee ee $25 
AutoTrace (program tracerand memory trashercatcher) . 2 6 666 eee ee ee ee ee ees $25 
C Compiler Torture Test (checks a C compiler against K&R) 2.5 66 ee $20 
Benchmark Package (C compiler, PC hardware, and Unixsystem) . . - - - eee ee ee ee ee $20 
TN3270 (remote login to IBM VM/CMS as a 3270 terminal on a 3274 controller) . 2. - 2 6 ee ee $20 
A68 (68000 cross-assembler), 5 g*c sak 6H RAF TORO Cee TE GOS e ws Oe SEEDED ee BE sR $20 
List-Pac (C functions for lists, stacks, and queues) 2 5 6. 2 ee ee ee ee $20 
XLT Macro Processor (general purpose text translator) © 6 6 6 ee ee $20 
C/reativity (Eliza-based notetaker) ©. 6 6 eee ee ee ee ee ee ew ee ee 315 
Data 
WordCruncher (text retrieval & document analysis program) . 6 6 6 6 ee ee $275 
DNA Sequences (GenBank 52.0 including fast similarity search program)... 6 1 ee ee ee $150 
Protein Sequences (5,415 sequences, 1,302,966 residuals, with similarity search program) . . 2. 6 ee ee ee ee ee $60 
Webster’s Second Dictionary (234,932 words) . 2. 6 eee et tee ee ee eee ee ee ee ee $60 
U. S. Cities (names & longitude/latitude of 32,000 U.S. cities and 6,000 state boundary points). . 2. 6 - ) ee ee ee ees $35 
The World Digitized (100,000 longitude/latitude of world country boundaries)... 1 ee ee ee es $30 
KST Fonts (13,200 characters in 139 mixed fonts: specify TEX or bitmapformat) . . . . / . 1 ee ee es $30 
USNO Floppy Almanac (high-precision moon, sun, planet & star positions) . 2... 6 6 ee ee ee ee $20 
NBS Hershey Fonts (1,377 stroke charactersin14 fonts)... 6 1 1 ee ee ee $15 
U.S; Map (15,701 points'of'state boundaries) 22 6355 6 KH EREDE EWE PGES KEE Oe He mE $15 
The Austin Code Works Voice: (512) 258-0785 
11100 Leafwood Lane acwlinfo@uunet.uu.net BBS: (512) 258-8831 
Austin, Texas 78750-3409 USA FidoNet: 1:382/12 


Free shipping on prepaid orders For delivery in Texas add 7% MasterCard/VISA 


by Phil Hughes 


THE UNIX FILE | 


Connecting UNIX Tools 


to Create 


A Program Solution 


to The UNIX File, a quick intro- 

duction is in order. I am one 
of the founders of Specialized Sys- 
tems Consultants (SSC), a computer 
consulting and training company. I 
started working with UNIX in 1980 
when I wrote a requirements state- 
ment for a computer system to be 
used for software cross-development. 
This resulted in the acquisition of a 
UNIx-based system. I installed the 
system and then trained the company 
employees in the use of that system 
and the C programming language. 
Since that time I have been involved 
in similar activities for other compa- 
nies. 

In 1983, SSC decided to specialize 
in UNIX systems exclusively and we 
acquired a second UNIX system—an 
IBM XT-class machine running Venix, 
a version of UNIX developed by Ven- 
turCom. Since that time we have ac- 
quired two AT-class machines, one 
running SCO Xenix 286 and the other 
running Microport V/AT, as well as 
a 386-based machine running SCO 
Xenix 386. Today, all computing needs 
at SSC are handled by UNIX-based 
systems, with the exception of graph- 
ics generation which is done on an 
Atari ST. 

At SSC we address four product 
areas: consulting, training, publish- 
ing, and software. Software develop- 
mentand distribution are all performed 
on UNIX systems using the standard 
UNIX tools. We even have a dial-in 
demonstration system that was devel- 
oped using UNIX shell scripts. Train- 


S ince this is my first contribution 


Phil Hughes is vice president of Spe- 
cialized Systems Consultants Inc. (SSC). 
Located in Seattle, Washington. SSC 
offers pocket reference booklets for vart- 
ous UNIX systems and utilities, and 
software, and offers training and con- 
sulting to UNIX users. 
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ing classes are hands-on using a 386- 
based system. 

In-house, the most interesting use 
of the UNIX system is in our produc- 
tion of publications. We produce all 
of our own catalogs, training note- 
books, and a series of Pocket Refer- 
ence booklets on C and UNK. All of 
these products are created using the 
UNIXtextprocessingsystem.We have 
relied heavily on the UNIX tools to 
make this an easy and efficient 
process. 

Most of our computer needs have 
been satisfied with programs devel- 
oped in-house and the use of shell 
scripts and UNIX utilities. The excep- 


Listing 1. The script file 


$ cat smry 


3 smry —- produce phone bill summary 


tions are a database system (Progress) 
to handle our customer lists and a filter 
that converts trof output to PostScript 
(devps) so we can proof our publica- 
tions on a laser printer and then set 
type on a Linotronic typesetter. 

Now that you know about my UNIX 
background and experience, let me 
address what you will be seeing in 
this column in the future. There will 
be a mixture of tips and techniques, 
things I have learned from my work 
with UNIX systems over the past eight 
years, and a discussion of new UNIX- 
related products. Emphasis will be 
on products that are useful for com- 
puters based on the PC and AT bus. 
I also encourage you to write with 
questions or suggestions for future 
columns. 


Solving Problems 

With UNIX Tools 

This month I want to talk about why 
UNIX should be considered a tool kit 
for solving problems. Most of us with 
a technical background think of a 
programming language such as C as 
the right way to solve many program- 
ming problems. Those with a less 
technical background are more likely 
to think in terms of using a BASIC 
program or possibly a database man- 
agement system. With most operat- 
ing systems, DOS being a good exam- 
ple, this is probably the reasonable 
way to approach a problem. 


# sort phone bill in phone number order 

sort -o phone.bill -t’|’ +0 -1 phone.bill 

# sort phone list in phone number order 

sort -o phone.list -t’|’ +1 -2 phone. list 

# output all lines from phone.bil]l with matching records from phone.list 
# then produce summary and totals by using awk 

join -a2 -t’|’ -jl 2 phone.list phone.bill | awk -F’|’ ’ 


BEGIN{ 

print "phone bill summary\n" 
print "Name or Company 
number = "dummy" 

} 

{ 

if ($1 != number) 

{ 

if (number != "dummy") 


printf "$%-30.30s %-15.15s %6.2f\n", \ 


name, number, total 
grand_total += total 
total = 0 
number = $1 
if (NF = 3) 
name = $2 
else 

name = " 

} 

total += $NF 
} 


22227 


END{ 

if (total > 0) 

{ 

grand_total += total 


printf "$-30,.30s %-15.15s %6.2f\n", \ 


name, number, total 


} 


print "\ntotal bill is $" grand total 
} 


§ 


number 
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Insist 

On The Best 
Micronics 
Motherboards 


uality, 
Performance 
and Innovation 
best describe our 80386 
based board level product 
line. Now with both AT and Baby size and 
high speed CACHE memory. 
Micronics is the leading supplier to OEMs, 


VARs and Systems Integrators that require aa gs 
the best in 80386 technology. Performance 


aes : ae near you, call MI CR@NI C S 


COMPUTERS INC. 
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CIRCLE 113 ON READER SERVICE CARD. 


Get a window on the future... 


without giving up the present 


At last, you can have the benefits of UNIX™ V.3 and high resolution graphics 
through X-Windows—without sacrificing the investment you’ve made in your DOS 


based 80386™ system. 


The Easy Way to X-Windows 


With the XSUBSYSTEM™ from Interfirm Graphic Systems, 
you'll have everything you need to convert your 80386-based 
machine into a powerful UNIX-based graphics workstation. 
Suddenly, you'll have all of the advantages of a multi-user, 
multi-tasking system—plus the incredible convenience of 
DOS programs runnng side-by-side with UNIX. 


The XSUBSYSTEM Includes: 


* 19” High resolution color or monochrome display 
* Graphics controller 

* UNIX System V.3 from Interactive Systems Corp. 

¢ The X Window System software 

* Optional hard disk with all software pre-installed 
* Optional TCP/IP Ethernet controller and software 


Ul) 8: 


GRAPHIC SYSTEMS 


3940 Freedom Circle 
Santa Clara, CA 95054 
(408) 970-7070 
800-3384 513 


The Future of 386 Based Graphics 


The XSUBSYSTEM gives you the best of both worlds - you get 
to keep both your hardware and your DOS software while 
you upgrade to a UNIX V.3/X-Windows System. 


For example, with Interfirm’s XSUBSYSTEM, you can take 
data created in a DOS application and paste it into a 
document that you are creating with the Frame Maker™ 
desktop publishing system. 


It’s the most sensible way we know to give your 386 based 
system the power of a true graphics workstation - at a sliver 
of the price. 


At Interfirm Graphic Systems, we are pioneering UNIX-based 
high-performance graphics, utilizing the X Window System 
and other emerging non-proprietary standards. When you 
look to the future of desktop computing, look to Interfirm. 


Trademarks: UNIX-Bell Laboratories; XSUBSYSTEM - Interfirm Graphic Systems; 
MS DOS - Microsoft; X Window System - M.LT.; 80386-Intel Corp; Frame Maker- 
Frame Technology Corp. 
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With the UNIX system there is such 
arich supply of tools. Many times you 
can connect a few of these tools to- 
gether to produce the desired result. 
Even if this approach isn’t efficient 
in terms of computer resources, it 
may offer a quick way to prototype a 
solution. Once it is running, programs 
can be written to handle the time- 
consuming parts in a more efficient 
manner. 

To help you look at possible uses 
for these tools, I offer the following 
problem. Every month you get a tele- 
phone bill and then stare at it for a 
while attempting to figure out the 
long distance charges. You look up a 
few numbers, remember a few, and 
finally accept the total as being close 
enough and pay the bill. Many of us 
have some sort of phone number list 
on a computer system. If we just had 
away to extract the name or company 
information by phone number and 
match it up with the bill, it would be 
easy to enter numbers and amounts 
and produce a report showing who 
was called and what it cost. 

The production of this kind of re- 
port isa mundane programming task. 
Print out a summary line for each 
phone number with the amount and 
print a grand total. This could be 
easily written in any programming 
language. The hard part of the project 
is how to match up the phone num- 
bers in the bill with those in the name/ 
phone number data file, and how to 
handle the numbers that don’t match— 
the numbers that are on the bill but 
not in the name/number data file. 

Join, a UNIX program that joins two 
tables, to the rescue. Join will match 
two files and print-selected fields from 
each file. It will also optionally print 
out the lines in one file that have no 
match in the other file. The only input 
requirements are that the files be 
sorted by the fields to be used for the 
match and that there be some sort 
of field separator character. 

Sorting can be handled by the sort 


Example1. The two data files 


$ cat phone,list 

Micro/Systems Journal|201-522-9347 
UNIX User |213-372-9917 

Office of the Americas|213-451-9761 
tecNICA] 415-848-0292 

Mike Lowry| 442-7170 

Met ro|447-4800 

Phil Hughes |527-3385 

Freedom Fund|547-7644 


$ cat phone.bill 
201-522-9347/1.68 
201-522-9347|4,15 
213-372-9917|3.71 
213-555-1212| .75 
415-848-0292|2.17 


$ 
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program, another standard UNIX util- 
ity. Just to keep things in the UNIX 
utility ballpark, I chose to use awk 
as a report generation program to 
produce the output. Listing 1 shows 
the script that invokes sort, join, and 
awk, Example 1 shows the two data 
files, PHONE.BILL and PHONELLIST. 
Finally, Example 2 shows the output 
from running the script. 

In this discussion, I want to con- 
centrate on the sort and join com- 
mands. Awk was discussed by Ian 
Darwin in “The UNIX File” column 
(Micro/Systems Journal, July/August 
1987). I will discuss awk further in a 
future column. 

First, the two sort lines. The —o 
option tells sort that the next argu- 
ment is the name of the output file. 
By using this option it is possible to 
sort a file “in place.” In other words, 
the sorted data ends up back in the 
original file. The -¢ option is used to 
specify the field separator character. 
If you look in Example 1, you will see 
that I used a | as the field separator. 
It was necessary to quote the charac- 
ter in the sort command line as it has 
special meaning to the shell. Finally 
the +0 and —1 options in the first sort 
command tell sort to skip no fields to 
get to the sort key and skip one field 
to get to the end of the sort key; in 
other words, sort on the first field. 
Looking at Example 1 again, you can 
see that the telephone number is the 
first field in the PHONE.BILL file. 

In the sort example, the +1 and —2 
options are used. This directs sort to 
skip one field to get to the beginning 
of the sort key and two fields to get 
to the end of the sort key. In other 
words, sort on the second field. Look- 
ing at Example 1, you can see that the 
telephone number is the second field 


in the PHONE.LIST file. 

Once the two files are sorted into 
telephone number order the match 
operation can be performed. This is 
done by join. The field separator char- 
acter is specified with the —t option, 
the same as with sort. The -j1 2 op- 
tion instructs join to perform the match 
on the second field in the first file. 
By default, the first field is used so 
the match specified in this command 
is the first field of the second file with 
the second field of the first file. The 
—a2 option tells join to print any lines 
from file 2 that have no match in file 
1. File 1 is the PHONE.LIST file and 
file 2 is the PHONE.BILL file, so join 
will print any lines that appear in the 
phone bill that are not in the name/ 
phone list. 

Example 3 shows the output of join. 
In the script this is used as input to 
the awk program that prints a format- 
ted report, but this processing could 
have been done with a program in 
any language. Note that the output 
is three fields for the lines that appear 
in both files, but two fields for those 
phone numbers on the bill that are 
not in the phone list. This is how the 
report program can determine if the 
number is known. In my report pro- 
gram I print ???? for the name if it is 
unknown. 

The data from the phone bill can 
be entered using a text editor. In fact, 
you may already have the phone list 
in this form. If not, awk can probably 
be used to reformat it. The end result 
is a workable system with no soft- 
ware purchases and about an hour 
of work. Oo 


Did you find this article particularly useful? 
Circle number 2 on the reader service card. 


Example 2. Output produced by running the script file 


$ smr 
phone bill summary 


Name or Company 
Micro/Systems Journal 
UNIX User 

2222 
tecNICA 


total bill is $12.46 


§ 


number 

201-522-9347 
213-372-9917 
213-555-1212 
415-848-0292 


Example 3. The output produced by join 


$ join -a2 -t’|’ -jl 2 phone.list phone.bill 
201-522-9347|Micro/Systems Journal|1.68 
201-522-9347|Micro/Systems Journal|/4.15 


213-372-9917|UNIX User|3.71 


213-555-1212] .75 


415-848-0292|tecNICA|2,17 


§ 
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Bring the 
Conveniences NR: An Implementation of the 
pP 
of U N H X To YOU R aie Word Processor 
MS-DOS NR is a text formatter that is written in C and compatible with 


UNIX's NROFF. Complete source code is included in the NR 
package so that it can be easily customized to fit your needs. NR 


Vi h 7 also includes an implementation of how -ms works. NR does hy- 

ac i ne phenation and simple proporational spacing. It supports automa- 

tioc table of contents and index generation, automatic footnotes 

Full Source Code on Disk! and endnotes, italics, boldface, overstriking, underlining, and left 
a and right margin adjustment. NR also contains: 


e Extensive macro and sting capability 

¢ Number registers in vaious formats 

¢ Diversions and diversion traps 

e Input and output line traps 
NR is easily conmfigurable for most printers. Both the ready-to-use 
program and full source code are included. For PC compatibles. 


Manual & Disk (MS-DOS) Item #33-X $29.95 


Receive On Command, Util and NR together for only 


On Command: Writing a UNIX- $85.95! You get all the convenience of UNIX-like fea- 
. tures and save 15%. 
Like Shell for MS-DOS 


by Allen Holub UNIX-like Features Package Item #167 $85.95 


This book and ready-to-use program demonstrate how to write a 
UNIX-like shell for MS-DOS, with techniques applicable to most 
other programming languages as well. The book and disk include a 
detailed description and working version of the Shell, complete C 
source code, a thorough discussion of low-level MS-DOS interfac- 
ing, and significant examples of C programming at the system 
level. 


To Order: Retum this order form with your payment to: 


M&T Books, 501 Galveston Drive, Redwood City, CA 94063 
Or, CALL TOLL-FREE 800-533-4372 Mon-Fri 8AM-5PM (In 


Supported features include: read, aliases, history, redirection and CA call 800-356-2002) 


pipes, UNIX-like command syntax, MS-DOS compatible prompt Name 

support, C-Shell based shell scripts, and a Shell variable that 

expands to the contents of a file so a program can produce text Address 

that is used by Shell scripts. Cisse ie en eee ote Zip 


The ready-to-use program and all C source code are included on 


I eee 
disk. For IBM PC and direct compatibles. LIVES! Please send me the following: 


UNIX-like Features Package $85.95 


Book & Disk (MS-DOS) Item #29-1 $39.95 On Command book & disk $39.95 
= /Util manual & disk $29.95, 
/Util NR manual & disk $29.95 
by Allen Holub Subtotal 
CA residents add sales tax % 


When used with the Shell, this collection of utility programs and 
subroutines provides you with a fully functional subset of the UNIX 
environment. Many of the utilities may also be used independently. 
You'll find executable version of the cat; cp; date; du; echo; grep; 
Is; mkdir; mv; p; pause; printevn; rm; rmdir; sub; and chmod. 


Add $2.95 per item for shipping 
Total 
A Check enclosed. Make payable to M&T Publishing. 
Charge my: CL] visa Limyc C] am.Ex. 
Card No. Exp.Date 
Signature 


The /Util package includes complete source code on disk. All 
programs and most of the utility subroutines are full documented in 
a UNIX-style manual. For IBM PC and direct compatibles. 
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Manual & Disk (MS-DOS) Item #12-7 $29.95 


Technical Brief 


OS/2 or UNLX? 


How are OS/2 and UNIX the same and 
where do they differ? 
And what do you have to be concerned with 
to port DOS applications 
to these multitasking platforms? 


by William G. Wong 


S/2 is taking developers by storm, although 

it may rain for awhile before we see applica- 

tions that will make users want to buy 
OS/2. And even though UNIX has been available on 
PCs for a much longer time, users are not flocking 
to that platform in droves either. Both OS/2 and UNIX 
have great potential, both provide advanced features 
for the user and programmer, and both have yet to 
really fulfill their potential. 

The main thing holding back OS/2 and UNIX right 
now is DOS (PC-DOS and MS-DOS). DOS’s popularity 
has been brought about by the abundance of applica- 
tions and low-cost PCs. Both OS/2 and UNIX require 
a great deal of resources compared to DOS. The 
normal low-end platform consists of an 80286 PC with 
2 megabytes of memory and a 30-MB hard disk. A 
more suitable platform is an 80386 with 4 MB of 
memory, a 60-MB hard disk, a tape backup unit, and 
an intelligent communications controller. OS/2 is pri- 
marily designed to run a system as a node within a 
local area network or attached to a mainframe or 
mini. UNIX can be used as a single-user system, but 
multiuser configurations are more common. 

Version 1.0 of OS/2 provides a DOS-like command- 
line interface. UNIX provides a shell that is also a 
command-line interface. Actually, the two are quite 
similar in function and really only differ in syntax. 
However, OS/2’s similarity to DOS gives it an advan- 


William Wong is president of Logic Fusion, Inc., a 
systems software development firm located in Morris- 
ville, Pennsylvania. 
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tage with DOS users. 

The program interfaces for OS/2 and UNIX are also 
similar in conventional system and file operations, as 
well as multitasking facilities. These similarities be- 
come a major issue when considering design of new 
applications and porting of existing DOS applications 
because both OS/2 and UNIX provide potential plat- 
forms for new applications. 

This article addresses the similarities and differ- 
ences between UNIX, OS/2, and DOS starting with the 
user interface. The general programming interface 
for OS/2 and UNIX will be compared, as well as the 
implications of graphics front ends (Presentation Man- 
ager for OS/2 and X Windows for UNIX). Finally, the 
issue of porting DOS applications to OS/2 and UNIX 
will be addressed. 


User Interface 
The infamous A> prompt is the hallmark of DOS, even 
though it did not originate there. (COMMAND.COM, 
the program that provides the DOS command-line 
interface, actually allows the prompt to be changed.) 
COMMAND waits for a user to enter a command that 
it then executes. Commands are either built into 
COMMAND programs (.COM and .EXE files) or batch 
files. COMMAND.COM can actually be replaced and 
there are a number of products that provide either 
an extension to COMMAND.COM or provide a menu 
interface. However, none of these programs has even 
made a dent in the number of systems using COM- 
MAND.COM. The only one that comes close is Micro- 
soft Windows, which does so for different reasons. 

(Continued on page 60) 
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Operating System Review 


UNLX on the PC 


A comparative look at SCO Xenix, Microport 
System V/386, and Interactive Systems 386/ix — 
three UNIX alternatives for the PC. 


“As with most sophisticated computer 

systems, panics will occasionally 

occur; they should not cause much 
concern,” 

The UNIX System 

Administrator’s Guide 


\ . ith those words of reassur- 


ance, let me add that the readers of 
this review will probably be divided 
into two camps: those who credit me 
with divine infallibility and those who 
don’t believe a word I say. To the first 
group I say, “No one is infallible,” and 
to the second group I say, “I tried.” 
Documentation for a UNIX system is 
variously 5000 to 10,000 pages, while 
typical binary distributions range in 
size between 15 and 30 megabytes, 
although those considered here are 
under 20 MB. For the next thousand 
years, scholars will argue about what 
constitutes a UNIX system and what 
is SVID (System V Interface Defini- 
tion) compatibility. Consider the fol- 


Bob Morein is founder of Automata 
Design Associates, specializing in Prolog 
language systems for DOS and UNIX. 
He holds a BS in engineering physics 
from Lehigh University, an MS in phys- 
ics from Temple University, and is work- 
ing for a doctorate in electrical engi- 
neering at Drexel University. In addi- 
tion to UNIX, he is interested in neural 
networks and digital design. 
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lowing myth: “The UNIX porting base 
provided by AT&T is truth, beauty, 
and perfection, and the porting ven- 
dors muck it up and make it buggy.” 

In fact, the porting base is full of 
bugs, which are only gradually being 
discovered by AT&T (they are human 
beings, after all!). It is the job of the 
vendors to fix the bugs of their re- 
spective ports, something that con- 
sumes a major portion of their time. 
For this reason we have not consid- 
ered bargain-basement ports, which 
are supplied as fixed releases, but 
only those that are supported and 
improved by a vendor. These include 
the Santa Cruz Operation (SCO), Mi- 
croport, and Interactive. The hard- 
ware used was a pair of essentially 
identical AT-compatible 10-MHz ma- 
chines, the Intel Inboard 386 with a 
math coprocessor, and a total of 4 
MB of memory (1 MB of 16bit, 1 
AT bus wait-state and 3 MB of 32-bit, 
1 wait-state). A 386 UNIX running on 
this platform provides performance 
equivalent to the Sun 3/60, which is 
a 20-MHz 68020 machine that runs 
the vaunted FFS (Berkeley Fast File 
System). 

Until recently there has been no 
universal benchmark for file system 
performance, but Arvin Park of Prin- 
ceton University has now provided 
that in the form of the “iostone,” which 
will join the whetstone and dhrystone 
in the pantheon of benchmarks. It’s 
interesting to note that, with a stan- 
dard Western Digital ST506-type con- 


troller (WD1002-WA2) and Micropo- 
lis 1325A 70-MB disk drives, two of 
the ports were able to exceed the file 
system performance of the Sun and 
come very close to that of the VAX 
8600. With a more modern controller, 
of which there are several, perform- 
ance might move beyond the VAX and 
the deluxe Sun/260 into the area pre- 
viously reserved for mainframes. 
There is no limit to the amount of 
possible disk storage since National 
Memory Systems (not to be confused 
with National of Japan) has an SMD 
(storage module device) controller 
with drivers for all of these systems 
that has a data transfer rate of 3 MB 
per second. It can handle up to 4 SMD 
drives, with a practical maximum of 
1.44 gigabytes per drive. The only 
limit on the number of controllers is 
power and slot space. 

I’m appalled at the popularity of 
bad benchmarks. But what makes a 
good benchmark? Ideally the num- 
ber should strongly correlate with 
the performance of a class of applica- 
tions. Inventors tend to neglect this 
fact, preferring to proliferate tables 
that compare the speed of some iso- 
lated function in the kernel. Readers 
are left with a pile of figures, and 
must conduct their own “benchmark 
of benchmarks” to determine which 
ones, or what combination with what 
weighting, corresponds to their 
applications. 

I believe that the dhrystone ade- 
quately represents CPU-bound per- 
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formance for non-floating-point ap- 
plications; the whetstone gives math 
speed; and the iostone satisfactorily 
imitates the disk activity of the aver- 
age installation. The iostone was com- 
piled by statistical analysis of disk 
activity at several sites, just as the 
dhrystone is the result of statistical 
analysis of program structure. We 
don’t want language-independent 
benchmarks, since virtually all UNIX 
applications are written in C, and the 
compiler is as important as the proc- 
essor in predicting system speed. 

If you will examine the charts, the 
surprising conclusion of this article 
is that the AT hardware is good for a 
lot more than critics say, and when a 
386 chip is used, you have a VAX- 
killer capable of replacing the VAX 
8600 series in departmental comput- 
ing. Further, with the UNIX System 
V, Release 3, innovation of RFS (re- 
mote file system) machines in a net- 
work can transparently mount and 
use file systems in any other machine 
also running RFS. So a distributed main- 
frame could be constructed that would 
replace the heaviest IBM hardware. 

All of these systems are roughly 
equivalent in function. It is practically 
impossible to compare every feature 
since there are hundreds of utilities, 
a dozen or more languages, and thou- 
sands of functions, and any one of 
them taken in isolation is not terribly 
important. But every release of UNIX 
contains major architectural features. 
Here readers may have to make hard 
choices. For example, the STREAMS 
facility is a significant improvement 
in the use of character I/O for inter- 
process communication. Applications 
such as X-Windows exist that require 
STREAMS or Berkeley sockets to run, 
and there is no satisfactory substi- 
tute. In practice, the reader may con- 
clude that the reliability and perform- 
ance of these systems has greater 
significance than the superiority of 
any particular feature. 


SCO Xenix 

There are about 400,000 UNIX sys- 
tems in the world to date. Of these, 
200,000 use Xenix, and the rest use 
BSD, ULTRIX, System V, and other 
things. SCO has sold 50,000 Xenix 
licenses, and is currently shipping 
2500 systems a month. Xenix-286 has 
been one of the mainstays of small- 
office systems, and the 386 version 
can do more. All SCO Xenix licenses 
are currently of the unlimited user 
type; other UNIX systems typically 
provide a two-user login limit with 
unlimited privileges available at extra 
cost. The system is robust and insen- 
sitive to hardware variations. This is 
important because we have only a 
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vague notion of what “386-AT” means, 
since there is no corresponding IBM 
model. Each machine has subtly dif- 
ferent hardware characteristics which 
become known in an unsubtle way 
—you try it, for example, and it doesn’t 
work. But SCO rises above this for 
the most part, except for disk control- 
ler problems. (They tell me the West- 
ern Digital WAH doesn’t work, but 
the WA2 does. Why?) RLL and ESDI 
are also supported, and a custom disk 
formatter handles drives not in the 
AT ROM table. The DTC RLL control- 
ler isrecommended, since that’s what 
SCO uses in-house; it gives an inex- 
pensive boost if you cheat with high- 
quality, non-RLL drives. 

The documentation is a marvel. 
They are the best books written on 
UNIX anywhere —-totally unpreten- 
tious (no new-speak!) and extremely 
complete, with two chapters on writ- 
ing device drivers. Eight binders con- 
tain the Imagen laser printout, which 
you could do yourself with the text- 
processing system. Read it, try it, and 
read some more. Then refer to the 
manual pages, which are supplied on- 
line (a very important feature for a 
school or remote use). There is com- 
plete conformance to the software, 
since the documentation is done by 
the same people. And they never stop! 
In the space of six months I’ve seen 
significant expansion and revision. 

There is complete downward com- 
patibility, to the extent that much of 
the software has been left as 286 code, 
but it’s fast. The Microsoft C com- 
piler has a 2:1 speed advantage over 
the Portable C Compiler, and the code 
is much smaller. The object file for- 
mat is not compatible with AT&T 5.3, 
but the next release will have COFF 
(common object file format) as a first 
step toward the 386-merged UNIX that 
will embrace and supersede the cur- 
rent products. Operations are ex- 
tremely well organized, and the de- 
gree of technical support provided 
may not be equalled by any other 
company in the software industry. 

Chief among concerns is to what 
extent Xenix is UNIX. It would be 
foolish to argue that it is not, since 
the kernel passes SVID. The utilities 
do not pass, however, which means 
there are operational differences—all 
to the better. More importantly, SVID 
conformance does not require that 
all components of 5.3 be implemented, 
and they are not. Networking is possi- 
ble, but it is heterogeneous, with spe- 
cial utilities for file transfer and log- 
ging into remote systems. The 
STREAMS feature is not available as 
of this writing, but a form is sched- 
uled for Release 2.3. It will permit 
local applications to run, but RFS will 


not appear until the 3.2 edition of the 
merge product. 

Multiview, available at extra cost, 
is a very interesting character-based 
windowing system, shell, and exten- 
sion to the operating system. It can 
project onto every compatible termi- 
nal in the system up to six windows 
of adjustable size. 


Microport System V/386 

The Microport System V/386 is the 
most complete UNIX distribution. As 
with Interactive’s 386/IX, it is a direct 
product of the UNIX System 5.3 porting 
base, and is almost completely docu- 
mented by the AT&T UNIX manuals. 
What distinguishes one port from an- 
other is the quality of the basic driv- 
ers, which are roughly equivalent to 
the system BIOS, the completeness 
of the distribution, and features or 
utilities beyond those supplied with 
the porting base. Every component 
of V.3 is available from Microport, 
including the Network Extensions 
which comprise RFS and STREAMS. 
The price of the complete 5.3 docu- 
mentation from AT&T is $780, but you 
receive most of that, and a two-user 
“limited” version of 5.3 as well, for a 
typical retail price of just $700. An 
unlimited user version is available at 
considerably greater cost. The sup- 
plied documentation includes: 


© Product Overview 

User’s Guide 

User’s Reference Manual 

System Administrator’s Reference 
Manual 

Programmer's Guide 
Programmer's Reference Manual 
Documenter'’s Workbench Software 
User's Guide 

Documenter’s Workbench Software 
Technical Discussion and Reference 
Manual 

Documenter’s Workbench Software 
Handbook 

Documenter’s Workbench Software 
Handbook for New Users 


It’s a pity they left out the Administra- 
tor’s Guide, but you can obtain it, as 
well as the rest of the manuals, from 
AT&T, or better yet, you can get the 
excellent Prentice-Hall edition from 
bookstores. And if you wish to com- 
plete your education, consider add- 
ing: 


¢ Network Programmer's Guide 

© STREAMS Primer 

© STREAMS Programmer's Guide 

© Utilities Release Notes (they charge 
you for the bug reports) 


The binders are handsome and will 
complement your library. It’s ashame 
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that divider tabs are not supplied. 
For UNIX aficionados there are ir- 
resistible attractions: 


e The Korn Shell, a much-improved 
shell resembling a structured pro- 
gramming language, replete with 
array data types. 

e The Green Hills “C” Compiler, 
known as a benchmark champion. 
If you must produce highly opti- 
mized COFF output, there is no 
other way at present. It would have 
done even better in the dhrystone, 
but the “-O2” switch was appar- 
ently not implemented. 


It is well known that Microport is 
working hard on implementations of 
X-Windows and GNU. There is only 
very limited downward compatibility 
with System V/286, since loading of 
Intel .OMF (object module format) 
files is not supported. 

Alas, Microport suffers from the 
travails of youth. Operations are iffy, 
so while shipment of new orders is 
fairly prompt, purchasers of support 
contracts frequently face delays of 
several months in receiving updates. 
Release dates tend to be so unrealis- 
tic as to be deliberately exaggerated. 
Support lines are frequently tied up, 
with promised callbacks often never 
being made or scheduled. A strong 
plus is the company’s new bulletin- 
board system policy which allows any 
registered Microport user to down- 
load bug fixes without charge, even 
in the absense of a service contract. 

The system is quite touchy with 
respect to hardware, but it is likely 
that by the time you read this article 
there will be significant improvements 
in fault recovery and hardware flexi- 
bility. 


Interactive Systems 386/ix 

The 386/ix operating system was 
tossed softly into my lap by UPS. Usu- 
ally you don’t toss UNIX documentation 
around. What was it, a credit memo? 
I eagerly ripped open the carton. 
Within were three small and shiny 
printed boxes obviously designed for 
retail display. The box of disks could 
be inverted and propped up for dis- 
play, like the packaging for a Timex 
watch. Three small D-ring binders 
emerged. A faint smile appeared on 
my lips. “At last,” I thought, “some- 
body’s read the documentation and 
isolated the important stuff.” As I 
flipped through the binders and saw 
only end-user documentation, another 
pleasant thought flitted past. “It’s on 
disk! It’s on disk?” The truth emerged 
when I read “Roadmap to the 
Documentation,” which tells you 
where to buy the complete documen- 
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tation package. 

I had not expected Interactive, 
which did the original 386 port, to 
embark on a brave experiment in prod- 
uct packaging, but that was precisely 
what the company did. Simple mathe- 
matics revealed that, at some point, 
there would be more UNIX users than 
developers. And who would sell them 
licenses? Interactive would, by repack- 
aging UNIX in a new menu-driven 
shell (with new terminology), a new 
editor, and a strictly hierarchial, a la 
carte approach to documentation that 
would reduce delivery costs and user 
confusion. The standard distribution 
has the two-login limitation, with an 
unlimited user version available at 
extra cost. 


SERIOUS DEBUGGING 
AT A REASONABLE PRICE 


Z 


All the speed and power of a 
hardware-assisted debugger at a 


software price 
Hardware-level break points 


In a sense, the Interactive product 
defines System 5.3 on the 386, and 
the AT&T documentation is directly 
and completely applicable. The im- 
portance of this should not be exag- 
gerated; the result of the porting con- 
tract was quite bare and undebugged, 
and has been subject to elaboration 
by AT&T, Microport, and Interactive 
as well. 

So if you’re a developer, you sepa- 
rately purchase the excellent Prentice- 
Hall edition (ask for the 386 edition, 
not 3B2) of the System V documenta- 
tion, and consider purchasing addi- 
tional port-specific documentation 
from Interactive as well. Since the 
software and the hardware have been 
through many different hands, the 


RUN CODEVIEW 
IN ONLY 8K 


MagicCV 


CodeView is a great integrated 
debugger, but it uses over 200K of 
conventional memory. MagicCV 
uses advanced features of the 80386 
microprocessor to run CodeView ina 
separate virtual machine in 
extended memory. This allows 


REAL-TIME break points on memory locations, 
memory ranges, execution, I/O ports, hardware 
and software interrupts. More powerful break 
points than ANY software-only debugger on the 
market. 
Break out of hung programs 

With a keystroke - no external switch necessary. 
Even with interrupts disabled. 


Breaks the 640K barrier 
Soft-ICE uses ZERO bytes of memory in the first 
1MB of address space. 

Works with your favorite debugger 
Soft-ICE can be used as a stand-alone debugger or 
it can add its powerful break points to the software 
debugger you already use. 

Solve tough systems problems too 
Soft-ICE is ideal for debugging TSRs, interrupt 
handlers, self booting programs, DOS loadable 
device drivers, non-DOS operating systems and 
debugging within DOS & BIOS. Soft-ICE is 
also great for firmware development because 
Soft-ICE’s break points work in ROM. 

How Soft-ICE Works 
Soft-ICE uses the power of 
the 80386 to surround your 
program in a virtual 
machine. This gives you 
complete control of the 
DOS environment, while 
Soft-ICE runs safely in 
protected mode. Soft-ICE 
uses 80386 protected mode 
features, such as paging, I/O 
privilege level, and break 
point registers, to provide 
real-time hardware-level 
break points. 


MagicCV 
Soft-ICE 


“Itis a unique, effective solution to a wide 
range of critical debugging problems" 
COMPUTER LANGUAGE -- April 1988 


Buy Both and SAVE $86 


(603) 888 - 2386 
Nu-Mega Technologies 
Nashua, NH 03060-7607 


30 day money-back guarantee 
Visa and Master Card accepted 
Both require 80386 PS/2 or AT compatible 


MagicCV to run CodeView using less 
than 8K of conventional memory on 
your 80386 PC. 

Don’t let 640K be your limit! 
If you are closing in on the 640K limit 
and would like the power of 
CodeView, MagicCV is for you. 

Find those hidden bugs! 
Even if you’re not closing in on the 
640K limit, running CodeView with 
MagicCV makes your debugging 
environment much closer to the end 
user’s program environment. You 
can use CodeView to locate subtle 
bugs that only occur when there is 
plenty of free memory. 

MagicCV is easy to use 
If you are a CodeView user, you 
already know how to use MagicCV 
too. Just MCV instead of CV, 
everything else is automatic. 


MagicCV with Soft-ICE 

Using Soft-ICE 
with CodeView 
gives you the 
features 
necessary for 
professional level 
systems 
debugging. 
MagicCV and 
Soft-ICE can 
work in concert 
with CodeView 
to provide the 
most powerful 
debugging 
platform you will 
find anywhere. 
As an extra bonus, by ordering both 
MagicCV and Soft-ICE together 
you SAVE $86. 

Codeview is a trademark of Microsoft Corp. 


$199 
$386 


PO Box 7607 


CIRCLE 103 ON READER SERVICE CARD 


MIcro/SYstEMS 23 


Table 1. A comparative look at UNIX performance on the 386 


Dhrystone Benchmark Results 
Microport/ Microport/ 
GCC 

4147 


Whetstone Benchmark Results 


SCO/ 
Microsoft 
6020 


Interactive 
2649 


single double 
549 555 


single double 


single double | single serie 
724 826 


729 819 | 1234 1388 
Note: Double precision really was faster. 


Comparison Figures 


Manufacturer/Model Dhrystones Whetstones 
Sun 3/60 (20 MHz 68020) 4273 
Sun 3/280 (25 MHz 68020, cache ) 6329 
DEC VAX 11/785 2090 
DEC VAX 8600 3866 


IBM PC/AT, large model approx 200 


Iostone Benchmark 


Vendor Interleave Disk Buffers Iostones 
SCO 2:1 537 2072 
SCO 2:1 1000 2259 

Microport PAB 1100 1049 
Microport ab 1100 1156 
Interactive 2:1 600 1990 


Comparison Figures 


Sun 3/60 local disk, 8 MB of RAM 1834 
Sun 3/60 Ethernet to Sun 3/280 server 1176 
Sun 3/280 local SMD disk—Fuji Eagle 3669 
Sun 3/280 Ethernet to Sun 3/60 1133 


Espresso Compilation 


Interleave Disk buffers C&L 


Vendor 


SCO 2:1 537 4:04 5.1 27.5 
Microport 3:1 1100 5:02 - 1:37.4 
Interactive 2:1 600 4:34 4.3 1:36.4 


Comparison Figures 


Sun 3/60 8 MB of RAM, 800K buffers 4:04 - - 
Sun 3/280 3:54 ~ - 
Apollo - - 
DN3000 8:27 - - 


C = Compile 
L = Link 
cat = cat source files to screen 


Load Degradation—Espresso Compilation 


Number of compilations running concurrently 


1 2 3 4 
Vendor Real time per compilation 
SCO 4:10 7:46 11:22 15:31 
Microport 5:14 11:13 15:58 22:44 
Interactive 8:56 12:42 
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tight integration found in the SCO 
documentation is absent. Although 
the AT&T documentation is markedly 
better now than in the past, the cover- 
age is still uneven, with an impractical 
bent and a tendency toward super- 
natural jargon: “Insert the medium 
in the disk drive.” The Interactive 
documentation is expensive but well 
written, and in fact, programming with 
the company’s window-like user in- 
terface cannot be done without it. 

Interactive has promised that pur- 
chasers of the software development 
system will receive the Programmer's 
Guide and the Programmer's Refer- 
ence Manual, and a very nice small 
book they have authored about writ- 
ing device drivers. It’s probably the 
last word, since they ported the ker- 
nel, and in conjunction with Writing 
a UNIX Device Driver (Janet I. Elan 
and Thomas J. Teixeira, Wiley, 1988), 
it puts the lie to the notion that you 
have to have the kernel source to 
know what you're doing. 

The port acquitted itself quite well 
in performance, although there were 
some embarrassing bugs and some 
hardware sensitivity. Also, perhaps 
it did not like having an enlarged 
process table. It seems that nobody 
knows how to format a disk these 
days. The surface analysis program 
repeatedly died when confronted with 
bad sectors, and the adddisk program 
terminated without warning or suc- 
cess when a second hard drive was 
installed. Sometimes a low-level for- 
mat of the entire disk would fix this, 
but toward the end of my repeated 
trials, it failed entirely. Interestingly, 
Microport admits thatits surface analy- 
sis is buggy, but in fact it did reason- 
ably well. Only SCO can edit the bad 
track table, handle nonstandard disks, 
find bad sectors with any reasonable 
certainty, and recover data from bad 
sectors in an existing file system. And 
both Microport and Interactive in- 
sisted on destroying the entire con- 
tents of the disk, rather than remain- 
ing within their own partitions. Mi- 
croport and Interactive map defects 
on the sector level, but allow only 62 
bad sectors per drive. Since there can 
be as many as 17 bad sectors on a 
single track, this scheme is only mar- 
ginally capable of handling large hard 
disks. 

Retail is new to Interactive. It was 
clear when I called that the switch- 
board operator did not know what 
technical support was because I was 
repeatedly connected to someone in 
janitorial, who didn’t know either. It’s 
possible I was the first caller; eventu- 
ally the call got through to some hap- 
less individual sitting in a dark room 
with his feet in a bucket of ice water. 
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Simply the BEST C and 
Pascal on AT, 386, Sun, 
Apollo, RT, VAX, 370 


“The most rock-solid C compiler in the industry. Superb technical 
support and portability. Superior code generated.” 

Gordon Eubanks, Symantec — Q&A (386). 

“It simply works, with no trouble, no chasing strange bugs, and ex- 
cellent warning and error messages ... a professional product.” 

Robert Lerche, Bay Partners. 

“For large-scale software development, the highest quality C compiler 

available on the market today. Pragmas are great. Quality of 

support is exceptional.” Randy Neilsen, Ansa—Paradox (D0s,0S/2). 


“15% smaller and 15% faster than Lattice C.” 

Robert Wenig, Autodesk. 
“Our software is running anywhere from 30 to 50% faster than when 
compiled under Lattice.” David Marcus, Micronetics. 
“We switched from Lattice due to a 10% reduction in code size. The 
compiler is very stable.” Lee Lorenzen, Ventura Software 
— Ventura Publisher, marketted by Xerox Corp. 
“Best quality emitted code by any compiler I've encountered. Often a- 
mazing.” Bill Ferguson, Fox Software — FoxBase (386). 
“Messages sometimes pointed out type mismatches, incorrect-length 
argument lists, and uninitialized variables that had been undetected 
for years [in 4.x bsd].”. Larry Breed, IBM ACIS [RT PC]. 
“Diagnostics turned up bugs missed by other compilers. Rapid bug 
fixes by technical support, someone who knew what he was 

talking about. 80386 code is well optimized.” 
Tim Addison, Logistics Data Systems. 
“386 protected mode support is fantastic, especially the access to 
large amounts of memory. It's mainframe compute power on a 
PG;” Dan Eggleston, Viewlogic. 
“The preprocessor supplied with Professional Pascal is quite useful. 
The code quality and control over segmentation and memory mod- 
els are superior to MS Pascal.” Bob Wallace, QuickSoft. 


Check Out These Reviews 

* High C ™; 
Computer Language 
Dr. Dobb's Journal 


February 1986, '87 
August 1986 


PC Magazine Jan. 21; 1987 (80386 version) 

Dr. Dobb's Journal July 1987 (80386 version) 

BYTE Magazine November 1987 — (80386 version) 
¢ Professional Pascal ™: 

PC Magazine Dec. 29, 1985 

Computer Language May 1986 

PC Tech Journal July 1986 


Journal of Pascal, Ada, & Modula-2. Nov.-Dec. 1986 
BYTE Magazine Dec. '86, June '87 (80386 version) 


Why MetaWare compilers 


¢ They are specifically designed for serious software developers. 

¢ They are reliable and robust: they don't break at every turn. 

¢ Their generated code is the best, or near best, on each architecture. 

¢ Their superior diagnostic messages help you produce better prod- 
ucts more quickly. 

* Your source can be ported with ease to the most popular systems. 

¢ You can link mixed-language modules from our compilers, others 

¢ You can benefit from high-level, personal technical support. 

* You can take advantage of the latest ANSI C extensions, and/or 
extensive Pascal extensions. High C has been tracking the ANSI 
Standard for two years; Professional Pascal will soon have a 
VS Pascal compatibility switch and several Apollo Pascal ext'ns. 

* You can take advantage of the latest 387 and Weitek 1167 support 
— we have the only compilers with Weitek real mode support. 


Power Tools for Power Users 


Ashton-Tate: dBase III Plus, MultiMate; Autodesk: AUTOCAD, AU- 
TOSKETCH (8087, '387, Weitek); Boeing Computer Services (Sun); 
CASE Technology (Sun); CAD/CAM giant Daisy ('86, '386, VAX); 
Deloitte Haskins & Sells; Digital Research: FlexOS; GE; IBM: 
4.3/RT, 4680 OS; Lifetree Software (Pascal): Volkswriter Deluxe, 
GEM-Write; Lugaru: Epsilon; NYU: Ada-Ed cmplr; Semantec: Q&A; 
Sky Computers; ... (Product names are trademarks of the companies indicated.) 


Industrial-Strength 


MetaWare C and Pascal compilers are designed for professional soft- 
ware developers. These tools are loaded with options to control 
them for special purposes. You can adjust the space-time trade-off 
in code quality. You can adjust external naming conventions to 
agree with linkers and operating systems. You can specify segment 
names for segmented architectures, and to help place code or data in 
particular places for embedded applications. You can select from 
five memory models for the 8086 family. And on and on. 


A Partial List of Optimizations 


Common subexpression and dead-code elimination, retention and re- 
use of register contents, jump-instruction size minimization, tail 
merging (cross jumping), constant folding, short-circuit evaluation 
of Boolean expressions, strength reductions, fast procedure calls, au- 
tomatic mapping of variables to registers (where advantageous), ... 


“Platform” — Code Quality 


Sun,Apollo,SGI— 18%, 3%, 26% > resident compiler (Dhrystone). 
PC: DOS, OS/2 — 3-10% > Microsoft C; 30% > MS Pascal, LatticeC. 
386 32-bit DOS— no competitors, since November, 1986. 

286, 386 UNIX — 66% better than pcc (Dhrystone, 386). 

VAX VMS — = DEC's excellent C and Pascal; better features. 
VAX Ultrix — 19% > pce (Dhrystone); much > Berkeley Pascal. 
RT PC/4.3bsd — 89% > the original port of pec (Dhrystone). 

370 CMS,UNIX— much better than any C, and VS Pascal. 

AMD 29000 — >40,000 Dhrystones! Available in Q2, cross. 


(408) 429-6382, telex 493-0879. Since 1979. 


Mats AN, Ws" 


INCORPORATED 
903 Pacific Avenue, Suite 201 ¢ Santa Cruz, CA 95060-4429 


The Clear Choice for Large 
Programming Projects — <1... 


© 1987 MetaWare Incorporated. MetaWare, High C, Professional Pascal, and DOS Helper are 


trademarks of MetaWare Incorporated. Others and their owners are duely respected. 
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(I did receive a nice callback later.) 
The people at Microport are more 
convivial, since apparently the phone 
system there causes every curious 
person to pick up the call, see if it 
pertains to him, and if not, transfer it 
randomly to some other person— 
until the switchboard operator pulls 
the plug. 


The Benchmarks Revealed 

The dhrystone (Table 1) was origi- 
nally coded in Ada by Rheinhold 
Weicker and converted to C by Rick 
Richardson. In each loop, 99 state- 
ments are executed: 52 assignments, 
32 control, and 15 procedure calls. 
Version 1.1 was used and compiled 
with all available compiler optimiza- 
tions. The result is given in number 
of dhrystones, or loops, per second, 
so the higher the better. Floating- 
point arithmetic is not used, so it is 
rather amusing to see reviewers quot- 
ing results with and without math 
coprocessors. When the same hard- 
ware is used to compare different 
compilers it becomes principally a 
test of code quality, since the kernel 
overhead is small, even in the worst 
of ports. Running multiple simultane- 
ous dhrystones gave rise to an inter- 
esting observation: The degradation 
per process was almost exactly the 
inverse of the number of processes; 
the whole was approximately equal 
to the sum of the parts. 

The iostone is described as “a syn- 
thetic file system performance bench- 
mark.” The authors discovered ap- 
parently universal laws, such as the 
approximate 2:1 ratio of reads to writes 
over many installations, and analyzed 
the size and frequency of blocks trans- 
ferred. The iostone performs neither 
strictly sequential nor random access 
but models a continuum, from very 
short blocks to 64K in a single trans- 
fer, all according to the empirical data. 
All operations occur within a single, 
temporary file. Although I have not 
tabulated performance of concurrent 
iostones (owing to the difficulty of 
getting sufficient disk space), degra- 
dation was severe. 

The whetstone measures the qual- 
ity of the particular code for the 80387. 
Some systems, notably SCO Xenix, 
suffer because they generate 80287- 
compatible code. The 80386 copro- 
cessor bug, not to be confused with 
the 32-bit multiply bug, tends to cause 
the sudden death of the operating 
system; this is a fix the OS people 
would prefer to leave to hardware, 
although there is, in theory, a way 
around it using software. In fact, SCO 
Xenix Version 2.3 has addressed this 
coprocessor bug with a workaround 
option at bootup. This solution, how- 
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ever, adversely affects coprocessor 
performance and does not work with 
all hardware. 

The espresso benchmark is the most 
interesting of all, at least to the pro- 
grammer. It measures the time re- 
quired to compile and execute 200K 
of C source code for a public domain 
program developed at the University 
of California at Berkeley, which de- 
signs two-level programmable logic 
arrays from truth table specifications. 
This activity provides an ideal mix 
sequential read (file loading), compil- 
ing (compute-bound activity), linking 


The AT hardware 
is good for a lot 
more than critics 
say, and when a 
386 chip is used, 
you have a VAX- 
killer, capable of 
replacing the 
VAX 8600 series. 


(random access), and sequential write 
(writing the object file). Examination 
of the multiprocess results indicate 
that the overall system efficiency ac- 
tually increased with the number of 
processes running, indicating that the 
UNIX ports for the 386 manage re- 
sources efficiently compared to their 
predecessors. 

Another test is the “cat to the 
screen” of all the source files, which 
gives some notion of screen band- 
width. The composite results explode 
the notion that a Sun is the requisite 
tool for heavy programming. If it 
weren't for the small, windowless 
screen, the AT-386 type would be the 
workstation equal of the Sun, and this 
disadvantage is likely to be removed 
before the end of the year. 


Reliability 

Because the 386-AT is the invention 
of many different engineering minds, 
the problem of hardware compatibil- 
ity rears its ugly head. The three 
operating systems split extremely fine 
hairs in all departments. You will dis- 
cover this in all it’s glorious absurdity 
when you read that Microport DOS- 


merge does not work with MAXTOR 
drives, or that EGA boards, which 
function perfectly under DOS cannot 
be made to work with SCO CGI (dis- 
cussed later). To this we must add a 
CPU chip for which the specifications 
change frequently! Intel publishes pe- 
riodic “revision of specification” docu- 
ments and has actually deleted some 
instructions. The coprocessor bug 
caused by a design flaw in the 386 
chip is universally present and gener- 
ates a small probability of a system 
crash per coprocessor instruction exe- 
cuted. A new mask, not yet in produc- 
tion, is one of the possible solutions 
to this position. 

Reliability problems in hardware 
that contained the 80387 coprocessor 
were encountered with two of the 
operating systems, while identical hard- 
ware minus the coprocessor worked 
flawlessly. The system crashes oc- 
curred during the compilation of C, 
which does not use floating-point, and 
would not be expected to invoke the 
80387, although it’s possible that the 
long integer arithmetic involved in 
process accounting does. But the syn- 
drome exactly matched a typical copro- 
cessor crash: A job failed to termi- 
nate, and an attempt to start another 
job or kill the hung process brought 
down the system. This is not proof 
that the coprocessor was the culprit, 
but the system with the coprocessor 
was put under a heavy load running 
Xenix for 40 hours without mishap. 

Users who have the “right” hard- 
ware report that all of these operating 
systems operate reliably, at least for 
their activities, while others complain 
endlessly via USENET. Had I reported 
perfect operation of all the systems, 
I would have been reporting a mis- 
leading, fortuitous result. A disk con- 
taining the benchmarks used in this 
article is available so that the reader 
can attempt to validate a particular 
software/hardware choice. But be fore- 
warned that it does not address the 
ability of a system to handle serial 
input/output. USENET reports indi- 
cate that there is a substantial differ- 
ence in the ability of these systems 
to handle high-speed serial input. 


Innovations and Directions 

In times gone by, a device-independ- 
ent, character-based interface for cur- 
sor-positioning terminals was one of 
the great achievements of UNIX. But 
in a bit-mapped world, UNIX lovers 
must grin and bear it when they are 
told that simple DOS provides a better 
CAD platform. The screen in graphics 
mode is mapped to one application 
unless, of course, there is a window- 
ing system. None of the systems con- 
sidered offer a windowing system at 
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present, but this may change shortly. 
What is alarming, however, is that 
the range of graphics devices sup- 
ported is quite narrow compared to 
that supported by DOS. SCO offers a 
precursor to windowing called CGI, 
which is an implementation of the 
ANSI computer graphics interface. It 
offers the customary graphics primi- 
tives and font generation facilities—in 
one window—and the fanciest adap- 
tor supported is standard EGA. The 
next step would be X-Windows, a wor- 
thy public-domain windowing system 
that would have become the UNIX 
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standard were it not for the Sun/ 
AT&T romance, which culminated in 
an exchange of vows and 20 percent 
of Sun’s stock. This means that we 
will eventually receive, as part of Sys- 
tem 5.4, the extremely complicated 
system NEWS, or Network Extensi- 
ble Windowing System, which pur- 
ports to communicate via Postscript 
over networks. Early users at Sun 
reported that it consumed all of your 
RAM and most of your patience. 

The last gasp of centralized proc- 
essing has just appeared in the form 
of the Sun River fiber-optic worksta- 
tions. Priced at little more than the 
cost of a complete 386 machine, each 
station is a bit-mapped EGA or Hercu- 
les terminal connected by a 30- 


megabit (Mb) fiber-optic link to a 
single 386 machine, which must re- 
fresh up to four bit-mapped work- 
stations simultaneously. This must 
stand as the most significant develop- 
ment since the discovery of n—p com- 
plete algorithms. More logical would 
be the marketing of diskless 386 ma- 
chines, as is standard practice in the 
workstation market. 

We may conclude that the future 
of UNIX on the 386-AT class machine 
is bright. Unfortunately, I cannot de- 
termine whether the brightness is 
one of cheer or that of a display stuck 
in reverse video. oO 


Did you find this article particularly useful? 
Circle number 4 on the reader service card. 


Write Better 


Turbo 4.0 Programs... 
Or Your Money Back 


You'll write better Turbo Pascal 4.0 programs easier and faster 
using the powerful analytical tools of Turbo Analyst 4.0. 
You get * Pascal Formatter * Cross Referencer * Program 
Indexer * Program Lister * Execution Profiler, 
and more. Includes complete source code. 

Turbo Analyst 4.0 is the successor to the 


acclaimed TurboPower Utilities: 


“Tf you own Turbo Pascal you should own the Turbo 
Power Programmers Utilities, that’s all there is to it?” 
Bruce Webster, BYTE Magazine, Feb. 1986 


Turbo Analyst 4.0 is only $75. 
A Library of Essential Routines 


Turbo Professional 4.0 is a library of more than 400 state-of-the-art 
routines optimized for Turbo Pascal 4.0. It includes complete 
source code, comprehensive documentation, and demo 
, programs that are powerful and useful. Includes 
/ * TSR management * Menu, window, and data 
/, entry routines * BCD = Large arrays, and more. 


Turbo Professional 4.0 is only $99. 
| Call toll-free for credit card orders. 
f 1-800-538-8157 ext. 830 (1-800-672-3470 ext. 830 in CA) 


Satisfaction guaranteed or your money back within 30 days. 


Fast Response Series: 

= T-DebugPLUS 4.0—Symbolic 
run-time debugger for Turbo 4.0, 
only $45. ($90 with source code) 

@ Overlay Manager 4.0—Use over- 
lays and chain in Turbo 4.0, only $45. 
Call for upgrade information. 


‘Turbo Pascal 4.0 is required. 

Owners of TurboPower Utilities w/o 
source may upgrade for $40, w/source, 
$25. Include your serial number. For 
other information call 408-438-8608. 
Shipping & taxes prepaid in US. & 
Canada. Elsewhere add $12 per item. 
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Operating System Review 


Running DOS 
Under UNIX 


You no longer have to abandon your favorite 
DOS applications to tap the power of UNIX. 


Q.. the past five years, the 


number of UNIX applications has been 
growing, so that today you can find 
almost any program you want. At the 
same time, however, people have be- 
come increasingly involved in the 
world of MS-DOS. Today, a person is 
more likely to be able to use Lotus 
1-2-3 or Wordstar than drive a stick- 
shift car. So, telling people to scrap 
their 1-2-3 templates and dBASE pro- 
grams is not the way to get them to 
accept UNIX. 

The ultimate solution is to offer all 
of UNIX’s advantages (a large base 
of utility programs, automatic com- 
munications packages, support of 
more powerful machines, and a mul- 
titasking, multiuser environment) to 
the DOS user; That is, allow users to 
keep running their DOS programs, 
but give them access to UNIX when 
they need it. And, that is exactly what 
this article is about: products that 
allow a UNIX system to support DOS 
users. 

The good news is there are such 
products and they do even more than 
I expected. And the bad news isn’t 
that bad. In fact, the only real bad 
news is that, as I write this article 
these products have just hit the street 
and they have kinks that have to be 
worked out. By the time you read 
this, I expect that most of the prob- 
lems will be a thing of the past. 


Phil Hughes is Vice-President of Spe- 
cialized Systems Consultants Inc. (SSC) 
located in Seattle, Washington. SSC 
markets pocket reference booklets for 
various UNIX systems and utilities, and 
software, and offers training and con- 
sulting to UNIX users. 
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by Phil Hughes 


The Concept 

The concept of running DOS as a 
UNIX task is not particularly difficult 
to grasp or achieve. If you look at it 
from the UNIX perspective, DOS is 
just another program to run within 
the operating system. You schedule 
it when it needs processor time, han- 
dle I/O for it, and wait for it to termi- 
nate. The problem is that DOS ex- 
pects to have complete control of the 
hardware—it expects to handle inter- 
rupts, access the disk drives, and hang 
in a loop waiting for a keystroke from 
the keyboard. It also assumes that 
your keyboard generates scancodes 
rather than standard ASCII charac- 
ters. Scancodes are sent to the CPU 
for all keys, including depressing a 
Shift or Control key. DOS also ex- 
pects to be loaded into the first 640K 
of RAM. 

Most of these problems can be 
solved by making changes to the de- 
vice drivers for DOS; make them talk 
to UNIX, which in turn will talk to the 
actual hardware. This adds software 
overhead but it works. Use of re- 
sources such as the floppy disks re- 
mains a problem because only one 
person at a time can use them. Soft- 
ware locks can be used to prevent 
multiple users from accessing them. 
Finally, loading DOS into the first 640K 
of RAM is not a problem on 80386- 
based machines. They have the capa- 
bility to support virtual machines, 
where each user only sees their part 
of the machine. On 286-based ma- 
chines the only practical solution is 
to allow a DOS user to occupy the 
beginning of RAM. This means that 
only one DOS user can be supported 
(along with multiple UNIX users) on 
286-based machines. 


The Products 

Two distinct products can be com- 
bined with different hardware and 
UNIX versions: Merge 386 (or 286) 
and VP/ix. (Except where a distinc- 
tion is necessary, I will refer to both 
Merge 386 and Merge 286 as just 
Merge.) Merge was developed by Lo- 
cus Computing Corporation. VP/ix 
was developed by Interactive Systems 
Corporation and Phoenix Tech- 
nologies Ltd. The other half of each 
combination consists of whatever fla- 
vor of UNIX is attached. We have four 
different UNIX flavors with two sig- 
nificantly different parentages to deal 
with. The hardest one to explain is 
Xenix. Xenix was developed by Mi- 
crosoft Corporation from AT&T’s UNIX 
Version 7. It contains enhancements 
to make it a more commercially vi- 
able product. Over the years, Xenix 
has been updated to comply with AT&T 
System III and now AT&T System V, 
Release 2, specifications. The Santa 
Cruz Operation (SCO) has developed 
generic ports of Xenix for XT, AT, 
and 386 clones, and markets this pack- 
age to end users. The result is that 
Xenix looks like AT&T System V, but 
when you start attempting to inte- 
grate support of DOS into UNIX it is 
clearly different. 

The remaining three flavors are 
Interactive Systems 386/ix, Microport 
V/AT, and Microport V/386. All three 
of these are directly descended from 
the AT&T System V release of UNIX. 
Both 386/ix and V/386 are based on 
UNIX System V, Release 3, and run 
on 386-based systems. V/AT is based 
on UNIX System V, Release 2, and 
runs on 286-based machines. 

Now, let’s get the DOS related prod- 
ucts paired with the UNIX flavors. 
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Think small in a big way 


When you think multiuser/multitasking, 
think Concurrent™ DOS 386, the bi 
name in small systems from Digital 
Research; architects of the first standard 
operating system for personal compu- 
ters. Now, Concurrent DOS 386 allows 
multiple users to share peripherals, files 
and applications, using serial terminal 
workstations linked by RS-232 cables 

to the system. It's fast, reliable and 
economical. 


The big news today is small systems 


Concurrent DOS 386 meets the increas- 
ing demands placed on small systems by 
supporting multiple DOS programs on 
both the system console and attached 
terminals. You can run popular pro- 
grams such as Lotus” 1-2-3? dBase” Ill, 
WordPerfect® and many more, with full 
math coprocessor support. The system 
runs up to 255 tasks simultaneously, with 
full intertask communications and byte- 
level record, file and device locking. 


For people who hate waiting in line 
Concurrent DOS 386 brings you all the 
remarkable speed and power of the 
Intel? 80386 processor. A prioritized pre- 
emptive scheduler allows task execution 


and intertask communication by several 
users at near full processor speed while 
letting some tasks “interrupt” others 
according to the needs of each user. 


A small system with a big memory 


Concurrent DOS 386 gives you access 
to four gigabytes of linear physical 
memory. lts powerful memory paging 
capability fully supports the Expanded 
Memory Specification with no addi- 
tional hardware or software. 


Menus at a touch 

Now you can create and customize 
menus, while programmable function 
keys let you condense complex com- 
mands to a single keystroke. The file 
manager runs standard operating 
system functions, plus you have an 
on-line help facility, text editor and 
support for DOS-based device drivers. 


Multiuser color graphics 


Now with the introduction of the new- 
est member of the Concurrent DOS 
family, Concurrent DOS 386/Multiuser 
Graphics Edition, your demands for 
high-resolution EGA bit-mapped graph- 


ics in the workstation environment can 


DIGITAL RESEARCH’ 
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be met. Take advantage of advanced 
technology allowing you to run popular 
DOS-based graphics programs on indi- 
vidual workstations as well as on the 
system console without sacrificing system 
performance. Ask us about this exciting 
new version of Concurrent DOS 386. 


All you have to remember 
is Concurrent DOS 386 


Concurrent DOS 386 from Digital 
Research is the name to remember 
when it comes to 386 technology. The 
power and versatility of Concurrent 
DOS 386 are giving a new meaning 
to the word multiuser. 


CALL DIGITAL RESEARCH AT 
1-800-443-4200 AND ASK FOR OUR 
CONCURRENT DOS PROGRAMMER 
INFORMATION KIT. 


CONCURRENT DOS 386: 
SHARING THE SYSTEM AFFORDABLY 


Digital Research and the Digital Research logo are registered 
trademarks, and Concurrent is a trademark of Digital Research Inc. 
Other product names are registered trademarks or trademarks of 
their respective owners. Specifications are subject to change without 
notice. Copyright © 1988, Digital Research Inc. All rights reserved. 


Eco-C88 


C Compiler with 
Cmore Debugger 


Professionals prefer the Eco-C88 C 
_ compiler for ease of use and its power- 
_ ful debugging features. Our “picky 
flag” gives you nine levels of lint-like 
error checking and makes debugging 
easy: 


‘T’m very impressed with the com- 
piler, editor, and debugger. I’ve tried 
quite a few different compilers for the 
PC and have given up on all of the 
others in favor of yours... I’ve gotten 
to the point where I download C code 


from a DEC VAX/VMS system just to 
be able to compile it with the picky 
flag set at 9. It finds lots of things 
VMS totally ignores...” 

JS, Oak Ridge, TN 


_ The Eco-C88 compiler includes: 
© A full-featured C compiler with 4 mem- 
ory models (up to 1 meg of code and 
data) plus most ANSI enhancements. 
¢ Without a doubt, the best error check- 
ing you can get. We catch bugs the 
others miss, making you much more 
productive. 
¢ Cmore is a full-featured source code 
debugger, not some stripped-down 
version. 
© Robust standard library with over 230 
useful (no “fluff’) functions, many of 
which are System V and ANSI compat- 
ible. Full source is available for only 
$25.00 at time of order. 
¢ CED, a fast, full screen, multiple- 
window program editor with on-line 
function help. You can compile, edit, 
and link from within CED. 
* cc and mini-make utilities included that 
simplifies the most complex compiles. 
¢ Users manual with over 150 program 
examples (not fragments) to illustrate 
how to use the library functions. 
¢ Fast compiles producing fast code. 


Our Guarantee: Try the Eco-C88 compiler for 
$99.95. Use it for 30 days and if you are not 


completely satisfied, simply return it for a full 
refund. We are confident that once you’ve 
tried Eco-C88, you’ll never use anything else. 
Call or write today! 


Orders: 1-800-952-0472 
Info: 1-317-255-6476 


Ecosoft Inc. 
6413 N. College Avenue 
Indianapolis, IN 46220 
= ECOSOFT 
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Merge runs with the Microport UNIX 
kernel. Thus, Merge 386 is available 
with Microport V/386 and Merge 286 
is available with Microport V/AT. 
These products are available from 
either Microport or Locus. VP/ix is 
available with either the 386/ix UNIX 
kernel from Interactive Systems or 
with the Xenix kernel from SCO. 


System Requirements 

Each product has virtually the same 
system requirements. The 386-based 
products require an Intel 80386- 
based computer, at least 2 MB of 
RAM, at least a 20-MB hard disk, and 
some kind of video board: EGA, CGA, 
or monochrome. (For Merge 286, sub- 
stitute an Intel 80286 processor in the 
requirements.) These are clearly mini- 
mums. Most of the 20-MB disk space 
is used up by UNIX, so add the amount 
of disk space you actually plan to use 
for your application. For serious use, 
4 MB or more of RAM seems more 
realistic. 

I ran the 386 tests on a MicroSmart, 
16-MHz, 386 system with 1 MB of 
32-bit RAM and 2 MB of 16-bit RAM. 
The system has a 64K cache buffer 
between the 16-bit RAM and the proc- 
essor, so actual access speed is close 
to what you could expect with all 
32-bit RAM. The system also has a 
monochrome video card, parallel port, 
two dumb serial ports, an 8-line Arnet 
SmartPort board, and AMI BIOS. Disks 
used were either an Atasi 3046 (36 
MB, 33-ms access) or Seagate 4096 
(80 MB, 28-ms access). 

The 286 tests were run on a Five- 
Star 8-MHz FS-286 system with 2 MB 
of 16-bit RAM, a monochrome video 
card, a parallel port and two dumb 
serial ports, and TriGem BIOS. The 
disk was an Atasi 3046. 

A word of warning on BIOS com- 
patibility. Merge 286 is “fully sup- 
ported” on an IBM PC/AT with IBM 
BIOS and TeleVideo Telecat 286 with 
TeleVideo BIOS. Although I had no 
problem with the TriGem BIOS, I sug- 
gest you check with Microport be- 
fore you buy Merge 286 if you have 
unconventional BIOS. Compatability 
is less of a problem on the 386-based 
system. 


Installation 

Installation of each package was pretty 
much the same. Basically there are 
three steps: 


1. Install UNIX. 
2. Install the DOS product. 
3. Configure DOS users. 


All of the 386 packages come with 
DOS. Merge 286 does not, so you 
need to make a copy of your existing 


DOS system disk (Version 3.0 or later) 
and install a modified image of DOS. 
This means the 286 installation takes 
a little more work than the 386, but, 
again, there’s not a significant differ- 
ence. My biggest problem was the 
confusion I faced because I was in- 
stalling all different packages, and each 
one had to be installed in a slightly 
different manner. (It reminded me 
of the time I was learning C, writing 
programs in PL/M, and teaching a 
Pascal class.) 

One final note on installation. Al- 
though I may make it sound easy, 
remember that I work for a UNIX 
consulting company and have done 
lots of UNIX installations. If you can 
install UNIX, you can easily install any 
of these DOS support products. Just 
remember, installing UNIX is much 
more complicated than installing a 
typical DOS system, which simply in- 
volves copying the floppy to the hard 
disk and rebooting. If you have never 
installed UNIX before, plan to spend 
at least a couple of hours for reading 
and a couple more hours to perform 
the installation. 


Initial Use 

Once you have everything installed, 
you can log on to UNIX and enter a 
command to get into the DOS envi- 
ronment. The command is DOS for 
the Merge products and vpix for 
VP/ix. Assuming you do this from 
the console, it will look as though you 
are now on a DOS system. Such com- 
mands as dir A: will do exactly as 
expected. To leave the DOS world 
from Merge, you just enter quit. A 
UNIX prompt returns. On VP/ix, de- 
pressing the Altkey and hitting SysReq 
gets you a menu. One choice is quit. 
Selecting it gets you back to UNIX. 

Using the multiple screens feature 
of UNIX allows you to run more than 
one UNIX session from the console. 
This also works for DOS. Control se- 
quences (although slightly different 
on the different products) allow you 
to switch between DOS and UNIX 
screens. The VP/ix menu also allows 
you to create a new UNIX shell with- 
out leaving DOS. 

So far, so good. I would like to point 
out that, at this point, I discovered a 
few minor bugs in each of the sys- 
tems. An example is that in SCO Xenix, 
while running GWBASIC under VP/ 
ix, the interrupt (control-C) and stop 
output (control-S) characters are ig- 
nored. I am working with beta re- 
leases of these products and am sure 
that all the bugs J encountered at this 
time will be corrected (or at least 
replaced with new, fresh bugs) by the 
time you read this article. 

One important deficiency I noted 
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at the time of this writing is the lack 
of support of intelligent communica- 
tions boards. If you plan to use DOS 
at multiple terminals, verify that your 
hardware is supported before you 
purchase a particular UNIX/DOS 
product. 

Terminal support is another con- 
sideration. The system keyboard gen- 
erates scancodes for each key change. 
This includes a scancode when a key 
is depressed and another when a key 
is released. Also, screen capabilities 
of the console video board may be 
unique. The screen is 25 lines by 80 
columns. Most terminals are 24 by 
80. The video board may also support 
a graphics mode. Both TeleVideo and 
Wyse make scancode terminals 
(TeleVideo PCS1 and Wyse 60), but 
this may not solve the support prob- 
lem if you plan on running an applica- 
tion that expects an enhanced graph- 
ics adapter. 


File Access 

Both products allow you to access 
floppy disks, a DOS partition on the 
hard disk, and the UNIX file system 
from DOS. Floppy disks and the DOS 
partition on the hard disk are ac- 
cessed the same way as under DOS. 
Under VP/ix, the DOS partition can 
be shared by multiple DOS users only 
if itis read-only. Merge allows multiple 
users to have read/write access. 

For most applications you will want 
to use the UNIX file system. Both 
products allow access to this file sys- 
tem from DOS in similar manners. 
Access is allowed by drive designator 
letters being mapped to the UNIX file 
system. All products allow multiple 
designators for the same file system; 
these allow you to move around more 
easily and satisfy the requirements 
of some applications programs. UNIX 
file access permissions determine the 
access privileges of these files. 

DOS uses the key sequence of car- 
riage-return then line-feed to designate 
the end of a line in a text file. UNIX 
uses only a line-feed. To allow for this, 
utility programs are provided to per- 
form the conversions. On each sys- 
tem, you will run the appropriate util- 
ity to either add or remove carriage- 
return characters. Additionally, un- 
der Interactive Systems’ VP/ix, there 
is a UNIX utility, Jef, which looks at 
the file and decides which conversion 
to perform. 

One nicety that was only in Interac- 
tive Systems VP/ix (actually, I expect 
it is part of 386/ix) was the way a 
missing floppy disk is handled. Most 
systems display an error message and 
abort the program or produce the 
DOSlike “Abort, Retry, or Ignore” 
message. VP/ix displays the message 
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“diskette not present—please insert, 
then automatically keeps polling the 
floppy drive for the disk. 


Installing DOS Applications 
Once VP/ix or Merge is installed, 
you can permit UNIX users to access 
the DOS capability. You can also in- 
stall DOS applications so they can be 
shared by all users. These con- 
figuration decisions are performed by 
equivalent programs in each environ- 
ment: DOSadmin for Merge and vpixad- 
min under VP/ix. 

In this area, Merge seems to have 
an advantage over VP/ix. Merge of- 
fers a menu program that allows you 
to install DOS applications and config- 
ure them. This includes specifying 
which CONFIG.SYS and AUTOEXEC- 


” 


.BAT files to use when an application 
is invoked. 


Running DOS Applications 
In order to see if DOS applications can 
really be run I tried the following: 


Lotus 1-2-3 

PC-Write 

Microsoft Word 

Alice-The Personal Pascal-demo 
Spybase 

Motorola Data Disk 

IBM AT Advanced Diagnostics 
OrCAD Demonstration Disk 


I started the tests with PC-Write and 
Spybase on each system. They are 
both very screen-oriented and I am 
familiar with their operation. There 
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were no problems on any system from 
the console. They ran properly and 
at an apparently reasonable speed. 
Switching to a UNIX screen and back 
to their screens seemed to work fine. 

The Motorola Data Disk is a data- 
base program and information on Mo- 
torola semiconductor products. It dis- 
plays information in about six differ- 
ent languages and offers various 
search methods. I ran this in two 
ways: directly offthe distribution floppy 
and by loading onto the UNIX parti- 
tion of the hard disk. The only prob- 
lem was that the screen print feature 
didn’t work under VP/ix. This could 
have been caused by a misunderstand- 
ing on my part. 

Microsoft Word was a different case. 
It ran under DOS directly and under 
Merge with no problems. Under In- 
teractive Systems’ VP/ix it worked 
fine, but if you changed to a UNIX 
screen and back to Word, the display 
became messed up. The only way out 
that I found was to kill the vpix proc- 
ess. On SCO VP/ix, as soon as you 
load Word the screen becomes a dis- 
aster. It is clearly a problem of initial- 
izing the display adapter, since the 
UNIX screens also end up sick. A 
reboot is the only way out. Neither I 
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nor the person who brought over 
Word to give it a try are experts on 
its use, however. 

Alice and OrCAD results were the 
opposite of each other. The Alice demo 
seemed to work perfectly with all sys- 


... you can 
execute DOS 
commands from 
the UNIX shell 
and UNIX 
commands from 
the DOS prompt. 


tems, OrCAD worked with none of 
them, even though it worked cor- 
rectly under DOS. The problem had 
to do with the video adapter. 

These results don’t surprise me. 
The DOS programs were written to 
run under DOS on specific hardware. 
Each of them seems to have made 


m 


some assumptions. At first, the fact 
that PC-Write worked surprised me. 
It is very screen intensive. Then I 
remembered that Bob Wallace, the 
author of PC-Write, used to work for 
Microsoft. I think his inside knowl- 
edge of MS-DOS has allowed him to 
create a portable product. 


DOS/UNIX Interaction 

There are three reasons you might 
want to run DOS under UNIX. The 
first is if you want to continue to use 
DOS but also want the advantages of 
a multiuser system. The second is if 
you primarily want to run UNIX but 
have an occasional need to run a DOS 
program. Third is if you plan to use 
both DOS and UNIX and want to com- 
municate between the two—in other 
words, you want to get the best of 
each, all in one package. All of the 
products supported communication 
between DOS and UNIX much better 
than I had expected. 

With some limitations, you can exe- 
cute DOS commands from the UNIX 
shell and UNIX commands from the 
DOS prompt. This means that you can 
write UNIX shell scripts that include 
DOS commands, embed DOS com- 
mands in make files, connect com- 
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mands (UNIX and DOS) with UNIX 
pipes, and even use the UNIX at com- 
mand to run DOS batch programs at 
a particular time. This powerful capa- 
bility offers a great chance to merge 
the capabilities of DOS and UNIX, and 
facilitates an easy transition path from 
DOS to UNIX. 


Documentation 

I found the Merge documentation to 
be the best. This is probably because 
the product has been in the field for 
the longest time. It consists of about 
20 pages of Release Notes, a 100-page 
Administrator’s Manual, and a 200- 
page User’s Manual. It is well done 
and accurate. All of this documentation 
is in addition to the UNIX documenta- 
tion. Note that it doesn’t matter 
whether you purchase the product 
from Locus or Microport. The docu- 
mentation is the same, only the color 
of the binders is different. 

The Interactive Systems VP/ix docu- 
mentation seems to be well done and 
complete but I found it harder to use. 
This could be due to the fact that I 
worked with the Merge documenta- 
tion first. There is less documenta- 
tion on VP/ix than on Merge but 
what is there is well written. My ma- 
jor complaint is the lack of sufficient 
user documentation on UNIX itself. I 
expect this indicates that Interactive 
sees its market as “canned” multiuser 
DOS systems rather than UNIX run- 
ning DOS. 

SCO VP/ix documentation comes 
in third, mainly because it lacks an 
extensive user’s guide. What is pro- 
vided is well written, and the fact that 
I have a “Controlled Release” is the 
reason. I am sure the manual will 
be up to snuff with the Merge doc- 
umentation when the full release is 
completed. 


Technical Support 
I used only the Microport and SCO 
technical support. I have had a lot of 
experience with SCO tech support and 
know many people who have worked 
with Microport tech support. The con- 
clusions seem consistent. 

SCO is the bigger company, and 
you will spend a lot of time talking to 
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you do get assistance, they will be 
helpful and walk you through a prob- 
lem. Just be sure to have a book to 
read or something else to do before 
you call. I give them a grade of B. 
Microport support is spotty. When 
you call you may get a busy signal, 
someone who doesn’t know what you 
are talking about, or an expert. You 
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will spend less time trying to find the 
answer but the answer you get could 
be excellent or relatively worthless. 
Recently this support seems to be 
getting better. I give them a B as well. 


Okay, Which Should I Buy? 
Which system do you choose? That’s 
a tough one. If you already are run- 
ning UNIX or Xenix on a 286- or 386- 
based machine, the answer is easy: 
Buy the one that works with what you 
have. If you are shopping for a com- 
plete package, I would let the selec- 
tion of the operating system deter- 
mine the decision. Picking the right 
operating system, in turn, depends 
on the intended use of the system. 
This decision could easily be debated 
in another article. 


Both Merge and VP/ix work under 
Xenix and 386/ix. Performance-wise, 
Merge seems faster with such things 
as keyboard input, and the user inter- 
face seems smoother. The other ad- 
vantage today is that Merge has been 
in the field for a longer time so more 
is known about it. On the other hand, 
SCO has a larger installed base and 
can therefore afford to put more ef- 
fort into the product. Six months from 
now there could be a clear difference, 
but I expect exactly the opposite. I 
expect the products to become more 
alike as the market expands and 
matures. 

A final note concerning Merge 286. 
If you have a DOS system at home 
and want to get your feet wet with 
UNIX, seriously consider this pack- 
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age. It will allow you to continue to 
run your DOS programs and give you 
access to real UNIX system with only 
avery small investment in hardware 
for some additional RAM and possibly 
a larger hard disk. 

On the other hand, if you are com- 
puter shopping, buy a 386 system. 
The difference in cost between 286 
and 386 systems is small, the periph- 
erals are generally interchangeable, 
and the performance improve- 
ment is significant. Then purchase a 
version of UNIX and a DOS support 
product that fits your specific 
requirements. go 
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puter systems have been used in major corpora- 

tions, government, and institutions equipped with 
mainframe and large minicomputers. Tne need to share 
information and computing resources has existed since 
the first computers were developed. The promise of effi- 
cient, affordable multiuser computing systems has been 
touted by major industry players since the announcement 
of the first 16-bit microprocessor. This promise has re- 
mained largely unfulfilled, however, due to the lack of an 
industry-accepted multiuser operating system, although 
many have existed since the mid-1970s. 

The UNIX operating system, released in 1973, opened 
new vistas in portable operating environments. It has 
been adopted by many large manufacturers for use in 
scientific and engineering applications, as well as nar- 
rowly targeted, vertical markets supported by independent 
value-added-resellers (VARs). These manufacturers in- 
clude, most notably, Hewlett-Packard, Digital Equipment, 
AT&T, Honeywell, and Unisys. Even IBM has had UNIX 
running on most of its systems, from the 370 architectures 
to the PC, for several years without much fanfair, and 
without much support. 

Several years ago, Microsoft purchased a UNIX license 
from AT&T and created the first commercial UNIX im- 
plementation for general office/business use on small 
systems. Microsoft named it Xenix and began distributing 
it to OEMs. The Santa Cruz Operation (SCO) joined forces 
with Microsoft to enhance Xenix. SCO adapted Xenix to 
the AT&T SVID (System V Interface Definition) standard 
and announced SCO Xenix System V in 1985. Several other 
developers have adapted UNIX for the Intel-based sys- 
tems, but SCO has garnered the lion’s share of the micro- 
computer UNIX market. 

The promise of affordable multiuser computer systems 
can now be fulfilled using UNIX as the operating system. 
With the right combination of hardware peripherals, UNIX 
installed on a 286/386-based system will efficiently sup- 
port up to 32 users. The important phrase is “right combi- 
nation of hardware peripherals.” An 80386, 20-MHz-based 
computer with three or four terminals employing stan- 
dard asynchronous communications ports will grind to a 


S ince the mid-1960s, time-shared, multiuser com- 
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halt with just a few user processes executing. That same 
computer with an intelligent serial communications sub- 
system can handle 32 terminals running simultaneous 
processes with little or no degradation. There are other 
considerations and system parameters that must be dis- 
cussed to qualify this assertion. These include hard-disk 
access, memory size and speed, caching for both system 
memory and disk I/O, as well as the nature of the applica- 
tions being executed. But these are topics for discussion 
in later articles. 

I elected to discuss the number of terminals and proc- 
esses as a factor in CPU performance because time- 
sharing multiuser systems were developed to support 
interactive terminal (tty) devices. UNIX is no exception. 
UNIX was originally developed by programmers for pro- 
grammers. The intense keyboard activity of programming 
is a critical factor in system performance. In fact, much 
of the UNIX design philosophy hinges around character 
processing. It has the highest priority relative to inter- 
rupts and commands significant system resources. For 
large character input/output functions, like word process- 
ing and text editing, this is significant. 

This article will deal with UNIX character processing and 
how it.can be enhanced to improve multiuser performance. 


How UNIX Handles Character I/O 

Character I/O is initiated by a user process requesting 
either that the UNIX kernel read or write one or more 
characters from a particular character device. For this 
discussion, this device will be a standard ASCII terminal. 
Most other character devices provide similar character 
type interfaces. Two types of requests are recognized for 
character I/O: write characters from the user process to 
the device, or read characters from the device to the user 
process. When either type of request is made to the 
kernel, a set of programs are invoked by the kernel to 
perform these tasks. These processes, known as “device 
drivers,” perform all of the functions necessary for the 
requested task. 

The UNIX kernel provides a set of internal buffers in 
support of character routines known as character lists or 
clists. Each clist is a linked list of blocks of characters 
known as cblocks. Device drivers for interactive terminals 
use clists extensively. A structure is declared for each 
terminal device available on the system. This structure 
contains headers for three clists to be used by the device 
driver for that device. The first clist is the “raw” queue, 
which holds the characters exactly as they are passed 
from the terminal to the computer in 8-bit format. The 
second clist is the “cooked” or “canonical” queue, which 
holds characters from the raw queue that have been 
processed by the “line-discipline” routines. The line- 
discipline routines process the characters requiring spe- 
cial handling for UNIX, such as rubouts, delete line, ex- 
panding tabs to spaces, inserting carriage returns with 
line feeds, and other processing as defined in the ¢ty 
structure. (The line-discipline routines will be discussed 
in more detail later.) The third clist is the “output” queue 
that holds characters to be sent to the terminal. 

Upon receiving a request for character processing from 
a user process, the kernel invokes the appropriate set of 
drivers for the specific device requested. The device 
driver routines perform five basic functions; open, close, 
read, write, and ioctl (I/O control). On the call to the 
device driver, the device is opened by the driver. If the 
request is for a write to the device, the kernel invokes the 
write routine which copies the requested characters from 
the user process space into the output clists for the device. 
The driver then begins the output process, which is 
continued by the interrupt handler within the driver. The 
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interrupt handler processes characters to the device one 
character at a time until the output queue is empty. If the 
number of characters to be output to the device is larger 
than the output clist can accommodate, the write driver 
manages the copying process to the output clist using a 
high-water, low-water scheme—the write routine fills the 
clist to the high-water mark and then goes to sleep while 
the interrupt handler empties the queue. When the low- 
water mark is reached, the write process is awakened to 
refill the queue. This continues until all of the requested 
characters are transferred to the queue. 

If the request is for a read from the device, the kernel 
invokes the read routine initiating the read process. Char- 
acters are input from the device by the interrupt handler 
into the raw queue and immediately transferred to the 
output queue as they are received so that there appears 
to be immediate echo to the screen. As characters are 
received into the raw queue, the read process transfers 
the characters to the cooked queue, processing all of the 
special characters as it makes the transfer. Upon receipt 
of a new-line character (typically a carriage return), the 
read process passes the characters to the user process 
for processing. This reduces the user process overhead. 
This mode of processing is known as “cooked” mode. 
The user process can request “raw” data, that is, receipt 
of each character as it is received without special process- 
ing. In this “raw” mode, the read process does not wait 
for the new-line character to awaken the user process 
with data, but rather passes characters to the user based 
upon either timed input or minimum packet sizing. 

Line-disciplines are processes that are typically stan- 
dard within the kernel. These routines process the special 
characters during the input/output process. In fact, the 
read and write routines in the device driver actually call 
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line-discipline routines to pass characters from the user 
space to the output queue, and from the raw queue to the 
cooked and output queue. 

The line-discipline routines provide both input and 
output processing. Input processing provides for erase 
and line-kill character processing. When the erase charac- 
ter is received, the line-discipline removes both that char- 
acter and the previous character. Echo can be disabled 
for raw input/output through the line-discipline, and out- 
put echo can be selected so that an erase character will 
echo backspace/space/backspace to the terminal. New-line/ 
carriage-return mapping, and lower-to-upper-case transla- 
tion are handled. Eight-bit characters can be stripped to 
7-bit for standard ASCII character processing, and tab 
characters can be expanded to spaces. 

Two very important functions are performed by the 
device driver and the line-discipline. One function man- 
ages software flow control, XON/XOFF processing, and 
the other function manages hardware control of modem 
lines or printers. Software flow control provides the termi- 
nal user with XON/XOFF and functions to stop output and 
restart it as needed. XON/XOFF control is also used by 
terminal firmware for intelligent terminals to control the 
flow of characters to the terminal from the CPU. This 
prevents loss of characters. The modem control protects 
the system security by either logging the user off when 
he hangs up, or hanging up the user when he logs off. 
Either case could prevent potential system security and/ 
or access problems. 

All of the device driver and line-discipline capabilities 
are maintained in a set of parameters in the tty structure 
created for each terminal device. These capabilities are 
available to the user through sfty system calls. The stty 
system calls, in turn, cause the kernel to invoke the device 
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driver’s ioctl process. Joctl processes provide the program 
interface to the device to modify such device parameters 
as baud rate, data bits, start and stop bits, and other 
processing done by the device. 

In raw-mode processing, the line-disciplines are still 
used but the tty structure is changed to prevent any special 
processing during the transfer. There are some obvious 
opportunities to optimize character processing in the raw 
mode by bypassing the line-disciplines during transfers 
(which is discussed later). 

The major drawback affecting system performance is 
that each time the device has a character available to be 
read by the device driver interrupt routine, or each time 
that the write or read routine places a character into the 
output queue, a high-priority hardware interrupt is gener- 
ated. With each interrupt, the CPU stops processing the 
current task, saves the system context, invokes the appro- 
priate interrupt routine, processes the interrupt, rein- 
states the previous machine state, and continues process- 
ing. If terminal I/O is in cooked mode, the interrupt 
handler also provides character “cooking,” tying up the 
CPU even longer. If the I/O device has multiple serial 
inputs that activate a single interrupt, then the interrupt 
routine also must poll the devices to find the interrupting 
device that takes more time. If the programmer who 
writes the device driver does not understand all of this 
(or doesn’t care about other system processes), then he 
could well decide to optimize his driver by reading multi- 
ple characters at a time from the device by putting his 
interrupt handler to sleep, or in a loop-waiting for a 
relatively slow device, making the whole system I/O- 
bound by his device. In these cases, even the system clock 
suffers since it has a lower interrupt priority than the 
terminal I/O and it begins to lose time. Since the device 
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drivers provided by the UNIX vendors and OEMs were 
written by UNIX programmers, these programming mis- 
takes tend not to happen. But the interrupt overhead on 
the CPU cannot be avoided if the I/O device is a typical 
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interrupt control. 
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degrade the system to the point where it becomes unre- 
sponsive and unusable. 

Multiport “dumb” serial boards also do little to improve 
throughput. These boards typically multiplex multiple 
ports onto a single memory-mapped I/O port and share 
the interrupt of that port. The CPU character handling 
overhead is not reduced. In some cases, the device drivers 
for these boards poll each separate port on the board to 
see which port caused the interrupt. In the process of 
polling, the device drivers block out lower priority inter- 
rupts to the CPU for an extended period of time. If the 
clock interrupt is at a lower priority than the serial I/O, 
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Using An Intelligent Serial I/O Controller 

To avoid severe system degradation with increased num- 
bers of users, the first area to try to improve system 
performance and throughput is with intelligent serial 
input/output boards. These boards offload processing 
from the CPU and can make dramatic improvements in 
terminal I/O processing. 

Intelligent multiport I/O boards contain buffers, typi- 
cally of the ring-buffer type, which continually fill and 
empty at the same time. As characters are entered at the 
terminal keyboard, they are buffered in the board’s mem- 
ory and do not necessarily interrupt the CPU. When 
characters begin filling the ring-buffers, the on-board 
processor collects the characters and interrupts the main 
CPU. The characters are typically collected in another 
area of the board’s memory for transmission to the main 
CPU. The board’s processor transfers chunks of data, 
sometimes complete cblocks at a time, to the internal clist 
structure of the appropriate tty. These transfers take place 
under control of the device driver interrupt handler for 
the intelligent device. The read process then invokes the 
line-discipline to process and echo the characters, as in 
the case of the dumb board. As the intelligent board 
processor is transferring characters to the main CPU, 
characters continue to fill the ring-buffer for processing. 
Because keyboard input is typically sent in bursts, differ- 
ent amounts of data will be transferred in a single inter- 
rupt. In each case, the ring-buffer will continue to fill, 
independent of the transfer block size or frequency. 

The relative efficiencies of the intelligent board depend 
upon several characteristics of its implementation. It is 
possible to improve the efficiency of the data transfer 
between the kernel and the device by implementing dual- 
ported memory that both the kernel and the device can 
access. In this manner, the CPU can read characters 
directly from the board memory as the input clist and 
save one level of overhead. The same is also true in the 
case of the output clist. For purists, this is really the 
character control block, not the clist buffers. 

Flow control to the terminal will be more efficient if the 
size of the ring-buffer is larger. Further, efficiencies can 
be obtained through optimization of the software execut- 
ing on the intelligent board. The efficiency of the ring- 
buffer algorithm, the handling of the double buffering and 
interrupts by the board’s processor, dynamic allocation 
of available ring-buffer memory to each port, and auto- 
matic character echoing (with the necessary processing) 
can differentiate one board from another in overall through- 
put and CPU overhead. For example, it is reasonable to 
assume that one device, such as a modem at 1200 baud, 
would not need as much memory for ring buffering as 
would a terminal operating as 38,400 bits per second 
(bps). Additional functions, such as flow control (both 
XON/XOFF and RTS/CTS), can be allocated to processing 
on the intelligent board to improve efficiencies even further. 

Intelligent boards can suffer some of the same pitfalls 
that dumb boards experience if the ports on the board are 
polled for input while interrupts are disabled. Since most 
of the intelligent boards support more ports than most dumb 
multiport boards, this problem could be exacerbated. 

The obvious advantage of the intelligent board handling 
of buffered characters for interrupt processing is to re- 
duce CPU overhead in handling interrupts. Such buffered 
multiport boards can offer a 30-percent to 50-percent 
reduction in CPU overhead and a comparable improve- 
ment in overall throughput. However, as the number of 
active terminals increases, the CPU overhead continues 
to climb until the same symptoms appear as in the dumb 
boards. This is a result of the character-at-a-time process- 
ing taking place in the device driver and the line-discipline 
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processing special characters and transferring characters 
between queues as required. This internal processing of 
characters is the dominant factor in the overhead of 
terminal I/O processing. An intelligent board that could 
offload this character processing could provide dramatic 
improvements in reducing CPU overhead and increase 
overall terminal throughput. 

Acomplete, intelligent I/O coprocessor subsystem that 
provides the foregoing character buffering mechanism, 
as well as character processing of the line-discipline, 
would make great strides toward the realization of the 
32-user 80386-based UNIX system. This type of subsystem 
would provide tab expansion to spaces, end-of-line map- 
ping, character delays as necessary, and case mapping 
provided on output by the UNIX line-discipline without 
any overhead from the main CPU. It could also perform 
the input conversions and character handling for erase, 
line kill, signals, and timeout/packet size processing re- 
quired for raw mode. The main CPU would only process 
characters as raw characters. It would only have to initial- 
ize the user process I/O request and handle the kernel 
portion of the device driver for data transfer between user 
memory and the I/O board’s memory. 

The need to use the kernel’s internal buffering clist 
mechanism could be eliminated if the subsystem and 
driver software were implemented efficiently. Rather than 
transferring data into a clist for processing, the device 
driver would transfer data directly into dual-ported mem- 
ory on the I/O subsystem board for processing. This 
would be raw data, and all required processing would 
take place on the subsystem board. Input data would 
likewise be processed on the subsystem board and the 
interrupt handler would simply copy the cooked data from 
the board’s dual-ported memory to user memory space 
so it is ready to be used by the user process. Processing 
by the main CPU would be memory-to-memory, similar 
to a caching mechanism, and the overhead would be 
dramatically reduced. 

Since this processing is relatively short in duration, the 
addition of active terminals would mean only a relatively 
small incremental increase in CPU overhead per terminal. 
This would have little effect on executing processes and 
overall system degradation would be minimal. 

Offloading the line-discipline and buffer processing to 
an intelligent subsystem is not without its problems. As 
in the case of the CPU’s character processing for dumb 
and intelligent boards, overhead on the coprocessor serv- 
icing the I/O subsystem will increase dramatically as the 
number of active terminals increase. This could cause 
overall throughput to suffer at the I/O subsystem. Al- 
though system degradation would be minimized, charac- 
ter throughput could choke the system and the symptoms 
would be very much like the cases mentioned with dumb 
and intelligent boards. To truly optimize system perform- 
ance, the intelligent subsystem must support the classic 
line-discipline functions differently than UNIX. The broad 
objective of developing the UNIX operating system was 
portability, which often meant trading off efficiency and 
eloquence. 

Some of the line-discipline functions could be optimized 
by modularizing these functions into a set of standard 
processes. These processes would represent normal in- 
put and output configurations and would not process 
little-used functions like upper-to-lower-case mapping and 
character delays for slow terminals. A typical output func- 
tion would be tab expansions to spaces and new-line/ 
carriage-return mapping. This would optimize output proc- 
essing, which accounts for 90 percent of the I/O function, 
and would drastically reduce the coprocessor overhead. 
Similar processing modules could be developed for input 
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processing and for character echo. 

Raw character processing also could be optimized by 
simply bypassing the line-discipline routines for special 
character processing and streamlining the flow of charac- 
ters from the asynchronous port to the buffered memory. 
Character timeout and minimum packet size processing 
could be handled by the subsystem and raw mode. I/O 
would be efficient and very low in coprocessor overhead. 

In addition to optimizing system performance by offload- 
ing and optimizing line-discipline functions, the intelligent 
I/O subsystem could further improve performance using 
dual-ported memory through which the device driver 
transfers large amounts of data, either to or from user 
memory, for a single-user I/O request. With sufficient 
memory on the intelligent board, a single screen refresh 
could be handled through a single request and a transfer 
of the screen from user memory to the subsystem mem- 
ory for immediate processing, all on a single interrupt 
with no character processing overhead. 

It is easy to see why manufacturers of complete intelli- 
gent I/O subsystems claim dramatic improvements in 
overall character throughput and reductions in CPU over- 
head. In some cases, intelligent subsystem manufacturers 
have achieved overall throughput in raw mode of approxi- 
mately 80,000 baud with 8 active 9600 bps terminals 
compared to approximately 40,000 baud for intelligent 
boards and 20,000 baud for dumb boards. These same 
subsystem manufacturers claim cooked-mode CPU over- 
head of less than 20 percent for 8 terminals compared to 
90 percent to 100 percent for intelligent and dumb boards. 

One of the obvious disadvantages to the intelligent I/O 
subsysiem is the need to develop independent firmware 
(the software installed on the board) with each implemen- 
tation and modification to the operating system. This 
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would not be nearly so prevalent with intelligent boards 
that do nominal processing in addition to character buffer- 
ing. This problem effects board manufacturers, but could 
also have an impact on system users since upgrading the 
operating system to the latest release could mean upgrad- 
ing the EPROMs and drivers for the subsystem board as 
well. It could mean taking down the system and in ex- 
treme cases could mean shipping the board to the manu- 
facturer for an upgrade. 

Some manufacturers have overcome this potential prob- 
lem by providing their firmware with the installable driv- 
ers, and having the initialization portion of the drivers 
download the firmware at boot time. The memory on the 
subsystem board serves as normal system memory as 
well as buffers for character I/O. This reduces the soft- 
ware upgrade problem for the user. More manufacturers 
should follow this lead, even for intelligent boards. 

The last advantage of intelligent I/O boards, both intel- 
ligent and subsystem boards, is their ability to add fea- 
tures to the I/O processing not available in dumb boards. 
The main feature available in intelligent coprocessor- 
based I/O boards is transparent print—the ability to con- 
nect a serial or parallel printer to the back of an ASCII 
terminal without installing two cables and using two ports 
on the I/O board. This feature allows two data streams, 
one for the terminal and one for the printer, to be multi- 
plexed over a single set of cables to the terminal. The 
printer is connected to a separate or auxiliary port on the 
back of the terminal. A special character code sequence 
(escape sequence or control code) is sent to the terminal 
to shut off the screen output and redirect output to the 
auxiliary port. A second sequence returns output to the 
terminal screen. 

The newer ASCII terminals are microprocessor-based 
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with many added features. These terminals typically have 
large character buffers. Printers, on the other hand, have 
relatively small buffers because they process those char- 
acters more slowly through the print head mechanism. 
If properly tuned, this provides the opportunity to multi- 
plex the two data steams so that the printer can be 
employed at maximum capacity without any apparent 
interruption in terminal processing. Small bursts of data 
can be sent to fill the printer buffer and be processed by 
the printhead mechanism while large bursts of data would 
be sent to the terminal buffer. By manipulating the size 
of these two data steams, the printer and terminal will 
appear to operate simultaneously without hesitation. 

Additional features are available through intelligent 
coprocessor serial boards. Multisession support can be 
implemented whereby the terminal user can log into 
multiple sessions on separate virtual screens on the termi- 
nal. Some boards implement this function using the mullti- 
page memory inherent in newer terminals, and some 
boards implement this feature on the intelligent board 
through use of terminal memory mapping on the board 
and virtual terminal algorithms. Function key mapping 
and function key programmable macros also are features 
that have been implemented on some intelligent boards 
as a result of the power of the coprocessor. No doubt we 
have just begun to see the expansion of features that will 
appear when the power of the intelligent peripheral be- 
comes the norm for multiuser systems. 


The UNIX Login Process 

The line-discipline process handles character processing 
or “cooking” for the device driver. This cooking of each 
individual character is a function of the requirements of 
the UNIX command-level syntax and the requirements of 


the applications program. Character processing is per- 
formed according to values set in a software structure for 
each terminal line, called the termio structure. This struc- 
ture establishes: first, whether characters will be proc- 
essed in a cooked mode or raw mode, and then what type 
of additional processing will occur. Options for input 
character processing include signal handling for interrupt 
and quit signals, erase character and erase-line charac- 
ters, flow control characters XON(ASCII dc3) and 
XOFF(ASCII del), new-line/carriage-return processing, 
and upper-to-lower-case mapping. Output processing op- 
tions are also selected in the termio tty structure and 
include new-line/carriage-return mapping, lower-to-upper- 
case mapping, expansion of tabs to spaces, and necessary 
delays for new lines, carriage returns, and form feeds for 
slower hard-copy-type terminals. Line characteristics for 
each terminal also are contained in the #ty structure, 
including baud rate, data bits, stop bits, parity and special 
hardware signal processing for data terminal ready, car- 
rier detect, and request-to-send signals. The last section 
of the tty structure determines how the line discipline will 
interact with the terminal driver. From this section the 
line discipline gets information describing whether to 
process signals, whether input will be cooked or raw, and 
how to handle character echoing to the terminal. 

The tty structure is read by a program called “getty.” 
Getty initializes the terminal line and opens it for process- 
ing. Initial settings for the tty structure can be established 
through the use of a system file called GETTYDEFS. 
GETTYDEFS contains a series of mnemonics that describe 
settings in the tty structure. When getty is called to open 
a terminal line, it is passed the line name, or device name, 
and an index into the GETTYDEFS file. Getty reads the 
GETTYDEFS file for that index and sets the #ty structure 
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accordingly. Getty then opens the line, which causes the 
device driver to invoke the line discipline using the tty 
structure set up by getty. The #ty structure can be altered 
as necessary through either an I/O control command to 
the driver, ioctl, or through the stty system command. 
These commands allow the interactive user or the applica- 
tions program to alter the character processing as neces- 
sary after the line is open. 

Cooked character processing by the line-discipline means 
that characters are passed to the requesting process on 
a line-by-line basis. That is, regardless of the number of 
characters requested, the device driver passes characters 
to the requesting process after a new-line or end-of-file 
character has been received. End-of-line in the UNIX file 
system is indicated by the new-line (line feed) character 
and end-of-file is indicated by an EOT (control D) charac- 
ter. If raw input processing is selected then characters are 
passed to the requesting process as received by the device 
driver. In raw mode, it is also possible to accept bursts of 
characters that should be processed together, such as 
escape sequences and programmed keys like special 
function keys, by setting the time flag in the fty structure. 
This flag causes the driver to wait for a specified period 


In UNIX remote terminal 
connections, 

the critical signals 

are between the DCE 
and the DTE. 


of time before passing characters to the requester. When 
the time has expired, the driver passes all characters 
received during that time to the requesting process. This 
allows small groups of characters to be processed together. 

Getty opens the line with initial parameters, typically 
parameters capable of accepting characters from most 
generic terminals, and issues a login prompt on the line. 
Upon acceptance of the login entry, getty establishes the 
final character processing parameters from GETTYDEFS 
and invokes the login process. As long as a line is to be 
used as an active terminal, all character processing can 
be established through the GETTYDEFS file and custom 
login parameters can be established after login. If the line 
is to be used for other than an active terminal, such as for 
a printer, getty should not be invoked for that line to avoid 
a login message. GETTYDEFS will not affect the character 
processing. In this case, a default set of tty parameters 
will be established for the line by the system developers 
supplying the UNIX system. If these parameters are not 
acceptable for a given device, for example a slower 2400- 
baud printer or a printer that can only provide XON/XOFF 
flow control and has a small character buffer, the parame- 
ters will need to be changed through the stty system 
command. This can be accomplished through the printer 
spooler interface to a specific device, or through an inter- 
face script or program, or through the system initializa- 
tion file that executes at boot time. 

Remote access to a UNIX system via modems presents 
additional requirements for serial I/O processing. Mo- 


44 Micro/Systems 


dems for dial-up lines typically operate at speeds slower 
than normal terminals. Parameters for the tty structure 
must be changed to process these characters. In addition, 
security becomes more important since there is no physi- 
cal control available as with local terminals. It is necessary 
to ensure that, when a user dials in to the system, the 
user has permission to process. When an authorized user 
exits the system, the line must be cleared for the next 
user access, and if the line drops off or hangs up before 
the user logs off, that user must be automatically logged 
off and the line cleared. All of this activity can be con- 
trolled through the same #ty structure that controls char- 
acter cooking. 


Handling DTE/DCE Serial I/O 

There are two types of devices that interface through the 
RS-232C protocol. The equipment that provides the inter- 
face to the source of information is known as the Data 
Terminal Processing Equipment or Data Terminal Equip- 
ment (DTE), and the equipment that provides the signal 
conversion to the communications link is the Data Com- 
munications Equipment (DCE). Each type of equipment 
provides a different part of the RS-232C interface. Data is 
transmitted from the transmit line of the DTE to the 
transmit line of the DCE through the communications line. 
It is then received by the receive line of the distant DCE 
and retransmitted to the receive line of the distant DTE. 
This takes place under one of two types of data flow 
control. The first is software control provided in the data 
stream and recognized by the character processing at 
both ends of the link, such as XON/XOFF or an ACK/NACK 
type of control. The second is hardware flow control as 
provided for in the RS-232C standard. This flow control 
is typically handled by two lines called request-to-send 
(RTS) and clear-to-send (CTS). The DTE typically activates 
RTS when it is ready to send data, and the DCE activates 
CTS when it is ready to receive data. Hardware flow 
control is usually not necessary unless the rate of data 
flow is sufficiently fast and the buffer size of the connected 
DTE and/or DCE is so small that it causes data to be lost 
during transmission. Typically software flow control such 
as XON/XOFF will suffice, but the delay in transmission, 
processing, and turnaround of the communications line 
could make software control too slow to be effective. For 
most modem connections, software flow control will be 
more than sufficient. 

The critical signals in UNIX remote terminal connec- 
tions are not the flow control lines but rather the initial 
connection signals between the DCE (modem) and the 
DTE (computer serial I/O port). These signals are called 
data terminal ready (DTR) on the serial port and carrier 
detect (CD) on the modem. The DTR is activated when 
the terminal is ready to receive data. In UNIX, the DTR 
line is activated by the device driver under a set of possible 
conditions. The first is that the tty structure indicates that 
the device connected to the line is a remote device. This 
flag in the tty structure is the LOCAL flag. If CLOCAL 
appears in the GETTYDEFS file, or if clocal is entered 
through the stty system call, the driver assumes that the 
device is local and DTR, CD, RTS, and CTS processing are 
ignored. If CLOCAL does not appear in GETTYDEFS, and 
the supplier of the operating system did not assume that 
the default line was a local line, or if the -clocal flag is used 
with the stty system call, then DTR, CD, RTS, and CTS 
signal processing are enabled. With processing enabled, 
upon opening of the line the DTR signal is activated, telling 
the DCE that the DTE is ready for processing. Typically, 
modem control will prevent the modem from answering 
a call if the DTR line is not active. (This is usually a setting 
on the modem and should be selected during setup.) If 
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the device driver for the serial port supports modem 
control, then the driver will await a CD signal from the 
modem before getty issues a login to the remote device 
through the modem. This prevents unauthorized access 
to the system if the port is not enabled since DTR is not 
active until the port is opened. The DTR signal can also 
be used to cause the modem to hang up the line, or go 
“on-hook” if the DTR line is dropped. 

The other critical line from the modem to the serial 
port is the carrier detect line. When the modem answers 
an incoming call (goes “off-hook”), it sends the distant 
modem a carrier tone. When the distant modem receives 
carrier, it returns carrier to the local modem. Upon receipt 
of the carrier, the local modem handshakes with the 
distant modem, then activates the CD line to the serial 
port. There is another ¢ty structure flag that plays an 
important role in remote terminal processing. The HUPCL 
or hang-up-on-close flag causes the DTR line to be deacti- 
vated upon close of the last process associated with the 
line. The combination of the not CLOCAL and the HUPCL 
flags provides security for remote terminals, plus it can 
provide ease of terminal support for local terminals. 

Use of the CD and DTR lines for remote terminals 
provides two critical functions for system security. The 
first is control of access to the modem. If the line with the 
modem is not open or enabled, the modem will not answer 
the phone line. If the distant terminal logs off or closes all 
processes on the terminal, HUPCL causes the DTR line to 
go inactive to the modem and the modem will hang up, 
which prevents line charges in the case that the distant 
terminal failed to hang up the modem. It also makes the 
line available to other remote users. In addition, if the 
distant terminal hangs up the modem without logging off 
or if line quality or interruption causes the modem to 
hang up prematurely, the loss of CD causes a hang-up 
signal to be sent to all processes attached to the terminal 
line logging off the remote terminal. This causes getty to 
recycle for a new login, and resets the modem line for the 
next remote access. 

These two signals can also be used for local terminal 
control. If the same ¢ty structure parameters are set to a 
local terminal, and control signals are implemented by 
running cable connections to the Tx, Rx, DTR, CD, etc., 
pins in the normal implementation of RS-232C on a DB-25 
connector, then turning off the power on the terminal has 
a similar effect to modem control. If the terminal hangs 
or is non-responsive to the CPU, then powering off the 
terminal drops the CD line to the serial port which drops 
the line, the hang-up signal is sent to the terminal proc- 
esses, and a new getty is spawned. This will usually clear 
non-responsive terminals and close all files and processes 
connected to that terminal without corrupting files or 
databases. Typically, the RTS and CTS lines of the DB-25, 
RS-232C connection should also be run or at least looped 
back upon themselves to maintain continuity and guaran- 
tee system performance. 


Summing Up 

Serial character input and output is the heart and soul of 
a multiuser system. If implemented correctly and with 
appropriate intelligent controller support, 286- and 386- 
based UNIX systems can support up to 32 terminals and 
beyond for specific applications. The market for cost- 
effective computing is growing rapidly, and technology 
is available to take advantage of that growing base. Under- 
standing serial I/O is an essential first step to taking full 
advantage of this market growth. o 
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hen confronted with both simple and com- 

plex problems, most programmers will go 

to a high-level language such as C, FORTRAN, 
Pascal, or BASIC. These languages offer portability, short 
development time, easily modifiable program modules, 
and can provide satisfactory program performance. High- 
level languages, however, do not provide the most com- 
pact machine code or the fastest execution time. When 
such a program has unacceptable performance, you can 
usually determine where the program spends most of its 
time and re-code these portions in assembly language. 

Many applications demand better performance than 
can be achieved with a mixture of low-level and high-level 
language programming. When maximum speed and/or 
minimum code size are essential, the only programming 
language that fits the bill is one that brings the program- 
mer as close to the CPU architecture as possible— 
assembly language. 

What types of applications require assembly-level source 
programming? Many powerful operating systems are pro- 
grammed in assembly language to minimize size and 
enhance speed. The highest performance software pack- 
ages available on the market today were written partially 
or entirely in assembly language. In many “real-time” 
processing systems (signal processing, process control, 
data collection and recording, etc.), assembly language 
is the only choice because the system must be synchro- 
nized to some external clock via the microprocessor’s 
interrupt mechanism while heavily taxing the machine’s 
capabilities. For example, a real-time communications 
signal processing system design I am currently involved 
with uses five VME-bus, 16-MHz, 68010 processors run- 
ning in parallel at nearly 100-percent capacity with assem- 
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bly language software. Using a compiled language would 
require at least one more CPU, and this additional proces- 
sor would increase the cost and reduce the reliability of 
the system. 

Use of assembly language alone, however, will not 
guarantee maximum performance. To achieve highest 
speed and/or smallest code size, assembly language pro- 
grammers must carefully consider and exploit the in- 
struction set and CPU architecture. Additionally, the pro- 
grammer must consider algorithm design. For example, 
a bubble-sort written in assembler would probably run 
more slowly than a quick-sort written in C. 

This article discusses details, techniques, and consid- 
erations for optimizing the speed and size of 8086/8088 
assembly language programs. I will conclude with a set 
of simple rules to follow when optimizing code (see the 
box). These rules and techniques can also be applied to 
some high-level languages and other microprocessors. I 
will assume the reader has a basic understanding of the 
8086 instruction set and architecture. Those who need a 
refresher should review The 8086 Family User's Manual 
and the references listed at the end of this article. All 
performance considerations and rules discussed in this 
article apply to both the 8086 and 8088 CPUs, except 
where otherwise noted. I will not discuss algorithm de- 
sign; countless authors have devoted books and articles 
on the subject for sorting, searching, floating point math, 
random number generation, etc. 


Speed versus Size—The Ultimate Tradeoff 

Contrary to what you might expect, a program written for 
minimum execution time is larger than a similar program 
optimized for size. This means that, for any given applica- 
tion, you must decide which is more important, compact 


code or high speed. 

In most situations, it is easy to decide when to trade off 
size for speed. Multiple decision-making and repetitive 
operations, such as sorting and searching algorithms, 
should sacrifice code size for speed. Interrupt and fast 
peripheral service routines should also be written for 
maximum speed. In controller applications where ROM 
space is at a premium, compact code is more important 
than speed. It is common to find engineers squeezing the 
last few bytes of code out of a program so that one or two 
ROMs may be eliminated from the target hardware. 


Counting Clock Cycles and Bytes 

The only way to squeeze the last ounce of performance 
out of your machine is to keep track of the execution time 
and code size while you write the code. That’s not to say 
you must calculate these numbers for every instruction 
in your program. Instead you should understand the 
general characteristics of the instruction set. We will 
review the instruction set and derive a few simple rules 
for program optimization. 

Table 1 shows typical performance characteristics for 
most data transfer, arithmetic, and logic instructions. The 
“execution time” column specifies the number of CPU 
clock cycles required to perform the instruction. For 
example, if an 8086 runs at 8 MHz (8 million clock cycles 
per second), then the instruction ADD AX,5 would take 0.5 
microseconds (4 clock cycles) to execute. 

It can be seen that instructions are most compact and 
execute faster when the operands are registers. This 
brings us to the first rule of Performance Programming: 
To save time when multiple iterations or operations are 
to be performed on a piece of data, keep that data in a 
register(s) as long as possible (as shown in Examples 1 
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and 2). When used in moderation, this rule can save 
significant amounts of time. Notice Example 1 is 20 per- 
cent faster than Example 2, but it is also four bytes longer. 
Unfortunately, the 8086 does not contain many regis- 
ters. Therefore, you can get into trouble when simultane- 
ously manipulating a lot of data because you end up 
spending more time swapping registers and memory 
locations than processing data. In these cases, it is best 
to keep only the most often used operands in registers. 


Memory Operands and String Operations 

Table 1 illustrates that execution time of memory operand 
instructions depends primarily on the number of memory 
transfers that occur, the addressing mode used, the data 
bus width (CPU type), and the operand alignment. 

All the logic and arithmetic instructions (except for 
TEST and CMP) require two memory transfers if the 
destination operand is in memory. Consider the following 
statement: 

ADD SUM, AX 

In this example, the CPU must fetch SUM from memory, 
add the contents of AX to it, and then store the result in 
memory location SUM. Total execution time is 22 clock 
cycles. If the two operands are swapped so that AX is the 
destination, the total execution time is reduced to 15 
clocks. This is because SUM is only read and not written. 
Therefore consider this corollary: When memory vari- 
ables are used in arithmetic and logic operations, try to 
use them only as source operands as much as possible. 

Remember that displacements used in base and in- 
dexed addressing modes can be either 16-bit or signed 
8bit numbers. If data can be placed in a 256-byte (or 
smaller) block, you can cut code size considerably by 
setting an index (or base) register to point to the middle 
of the block and using indexed or base offset addressing 
to access the data. This causes the assembler to generate 
byte offsets instead of word offsets for the opcodes. 

Example 3 shows a one-byte saving over direct ad- 
dressing because three bytes are required to initialize the 
index register. This is a small savings in code, but it may 
prevent an additional ROM requirement. If a large number 
of data accesses can be executed before the index register 
is modified, the savings will be substantial. 


Example 1 


Example 2 


BL, SUM ADD SUM, AL 
BL,AL NEG SUM 
BL . 

SUM, BL 


12 bytes/35 clocks 8 bytes/44 clocks 


Example 3 


SI, OFFSET OPC 
DX, [SI] 

DX, OPD-OPC[SI] 
DX, OPA-OPC[SI] 
OPB-OPC [SI] ,DX 


: Point to middle of data 
; Access OPC 
7 Access OPD 
; Access OPA 
; Access OPB 


14 bytes 
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On 8088 machines, 16-bit memory operand instructions 
require four more clock cycles per data transfer than the 
execution times shown in Table 1 (assuming no memory 
wait states) because the CPU’s external data path is only 
8 bits wide. 

On 8086 machines (not 8088 machines), memory oper- 
and alignment has a direct impact on program speed. 
Whenever the CPU must access a 16-bit memory operand 
located at an odd address, it must split the operation into 
two 8bit accesses. This adds four clock cycles to each 
memory access. Instructions that must access the oper- 
and twice (NOT, NEG, etc.) will take eight clock cycles 
longer to execute. This performance degradation can 
become quite appreciable during string operations. For 
example, an 8MHz 8086 executing a 32K word string 
MOV operation will suffer a performance degradation of 
33 ms (almost 50 percent) if the source and destination 
operands are not word-aligned. 

This does not mean that byte string operations will 
execute as fast as non-aligned word string operations of 
equivalent length. Using our previous example, we find 


that performance is degraded by an additional 37 ms if a 
64K move is performed instead of a non-aligned 32K word 
move. Therefore, if speed is of utmost importance and 
large amounts of data are to be moved (or stored), the 
operation should be performed with a MOVSW (STOSW) 
instead of a MOVSB (STOSB) instruction. 

If the string length is always an even number of bytes, 
then converting the string length from a word count to a 
byte count can be done by simply shifting it right one bit. 
If there is either an even or odd number of bytes in the 
destination string, then a routine similar to the one shown 
in Example 4 can be used to convert a byte string move 
to a word string move. 

For short string move operations, using multiple MOVS 
instructions instead of the REP MOVS instruction will 
reduce execution time and can reduce code size by a few 
bytes. Two different ways to move a four-word string are 
shown in Examples 5 and 6. 

An added benefit illustrated in Example 5 is that the 
CX register remains unaffected by the string operation. 
Concerning execution time, the “break even” point for 


Table 1. 8086/8088 Data Manipulation Instruction Performance 


Instruction Operands Execution Time Transfers 


Length 


ADD, ADC 
SUB, SBB 
AND, OR, XOR 


REG, REG 
REG, IMM 
ACC, IMM 
REG, MEM 
MEM, REG 
MEM, IMM 


REG, REG 
REG, IMM 


10-77 
118-133 


(80-98) 
(128-154) 
(76-83)+EA ((86-104) +EA) 1 
(124-139)+EA ((134-160)+EA) 1 
DIV (IDIV} 80-90 (101-112) = 
144-162 (165-184) 
(86-96)+EA ((107-118)+EA) 
(150-168)+EA ((171-190)+EA) 


RCL, RCR 8+4/bit 
ROL, ROR 15+EA 
SAR 20+EA+4/bit 


INC, DEC 


String Instructions 


LODS (STOS) - 12(11) 1 1 
repeated - 9+13/rep (9+10/rep) l/rep 2 
CMPS - 22 2 1 
repeated 9+22/rep 2/rep 2 
SCAS = 15 1 1 
repeated - 9+15/rep l/rep 2 


MOVS - 18 2 1 
repeated - 9+17/rep 2/rep 2 
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Miscellaneous Instructions 


16+EA (17+EA) 


PUSHF (POPF) = 10 (8) 1 1 


LAHF, SAHF oy 4 oe 1 


CBW (CHD) 


Memory Operand Effective Address 
Calculation Time (BA) 

Addressing Mode Clock Cycles 
(BX], [BP], [SI], or [DI] 
Displacement only 
(BP+DI] and [BX+SI] 
[BP+SI] and [BX+4DI] 
dispt+([(BX], [BP], [SI], or [DI]) 
disp+([BP+DI] or [(BX+SI]) 
disp+((BP+SI] or [BX+DI]) 


Note: Execution times assume instruction has already been 
fetched and no memory wait states are required. Add 4 clock 
cycles for each memory word transfer on 8088 machines and 
each odd addressed memory word transfer on 8086 machines. 
Add 2 clock cycles for each segment override prefix. 
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using the REP prefix is for strings that are 13 elements 
long. So, for the fastest possible code, use multiple MOVS 
instructions for string moves of 12 elements or less. 

The above discussion also applies to the STOS and 
LODS instructions, although I am not sure how much 
value there is in repeatedly using the LODS instruction 
without some additional code. 

By the way, string instructions that start with CX=0 will 
not execute! This prevents the machine from operating 
on a full segment of data (65,536 bytes) with a single 
byte-string instruction. The only way around this limita- 
tion is to perform the operation with CX=65535 and then 
execute an additional string instruction. 


Multiplication and Division 
Think twice before using the multiply and divide features 
of the 8086 hardware. If you are multiplying or dividing 
numbers by a power of two, shifting left or right by the 
appropriate number of bits can make performance up to 
70 times faster than using the MUL and DIV instructions. 
What if you are multiplying by numbers that are not a 
power of two? You should still consider using a multiply 
routine instead of the MUL instruction if one of the num- 
bers you are working with is a constant value. For exam- 
ple, suppose the number y in the AL register must be 
multiplied by 10. Two ways to perform this multiplication 
are shown in Examples 7 and 8. Example 7 is nearly 11 


Example 4 


MOV CX, BYTE_COUNT 
MOV DI,OFFSET DESTINATION 
MOV SI, OFFSET SOURCE 
SER CX,1 
REP MOVSW 
INC NO_ADD 
MOVSB 
NO_ADD: . 


; Convert byte count to word 
count and move the data. 

: Move 1 more byte if BYTE COUNT 
was odd before the shift. 


Example 5 Example 6 


MOVSW MOV CX, 04 
MOVSW REP MOVSW 
MOVSW 
MOVSW 


4 bytes/72 clocks 5 bytes/81 clocks 


Example 7 Example 8 


1 pAX=2y MOV DX, 10 
AX ;save 2y in DX MUL AX,DX 
1 jAX=4y 

1 7AX=8y 

DX =; AX=8y+2y=10y 


SHL AX, 
MOV DX, 
SHL , 
SHL ’ 
ADD , 


11 clocks/10 bytes 118 clocks/5 bytes 


Example 9 (fast) Example 10 (compact) 


SHIFT_IT MACRO : 
REPT 4 - 
SHL AX,1 MOV CX,4 
RCL DX,1 SHIFT_IT: SHL AX,1 
ENDM RCL Dx,1 
ENDM LOOP SHIFT_IT 

; Actual code is below: ‘ 


SHIFT_IT 


16 bytes/32 clocks 9 bytes/76 clocks 
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times faster than Example 8, but also requires twice as 
much code. 

If the application requires code size to be minimized, 
the single word AAD instruction can be used to multiply 
AH by 10 and add the product to AL. AAM can be used to 
divide AL by 10. 


Program Transfers 

The program transfer instructions allow for compact cod- 
ing (CALL/RET, LOOP, etc.) and program decision making 
(JZ, JGE, etc.). They also take a lot of time to execute. 
Table 2 gives the performance characteristics for these 
instructions. 

If coding for speed is the objective, then subroutines 
and unconditional jumps should be avoided. Although it 
makes sense to place an often-used algorithm in a subrou- 
tine, the overhead of doing so will add 17 to 67 clock cycles 
each time the subroutine is called. To avoid subroutines, 
duplicate the desired code wherever it is needed. You can 
do this by simply copying the routine, but a better way is 
to use a macro. (A macro is a single program line that 
represents one or several instructions. When it is ex- 
panded during assembly, the assembler replaces the macro 
call with the group of instructions it represents.) Macros 
make the program easier to read and take up less room 
(in the source file) than if the routine is duplicated every- 
where it is required. 

This rule applies just as well to looping and other 
iterative operations. Examples 9 and 10 illustrate the case 
of shifting a 20-bit number in Dx:AxX left by 1 nibble. Here 
the time savings is 58 percent. 

You can modify this technique by executing several 
operations during each loop iteration if the program must 
perform a large number of operations and still remain 
reasonably compact. The result is fewer program jumps 
and faster execution. In Examples 11 and 12, a table of 
1000 data points is normalized by multiplying each point 
by a variable power of 2 (contained in CL). CL=4 in these 
examples. 

By doubling code size, we reduce execution time by 
about 21 percent. If the “brute force” technique of Exam- 
ple 9 was used, execution time would be reduced to 45,004 
clock cycles but the code size would be 4003 bytes. 

When writing code for minimum size, use subroutines 
wherever possible and avoid using code macros. How- 
ever, don’t forget to consider the code “overhead” (CALL 
& RET instruction size) required when deciding which 
portions of code to locate in a subroutine. In most cases, 
an instruction sequence that is at least 5 bytes long and 
appears more than once in the program should be made 
into a subroutine. 

There is a trick you can use to save even more space 


Table 2. Program transfer instruction performance 
Instruction Mode Execution Time Length 
CALL register indirect 16 2 
direct NEAR (FAR) 19(28) 3(5) 
memory indirect NEAR (FAR) 21+EA (37+EA) 2-4 
RET no pop NEAR (FAR) 8(18) 1 
pop NEAR (FAR) 12(17) 3 
oMP register indirect ll 2 
direct NEAR (FAR) 15 2-3 (5) 
memory indirect NEAR (FAR} 18+EA (24+EA) 2-4 
Conditional jump short - relative 16 - jump taken 2 
4 - jump not taken 
LOOP short ~ relative 17 - jump taken 2 
5 - jump not taken 
LOOPE (LOOPNE) short - relative 18(19) - jump taken 2 
6({5)-jump not taken 
INT type 3 52 1 
not type 3 51 2 


IRET = 24 1 
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when subroutines call other subroutines. If the last in- 
struction of a subroutine (excluding the RET instruction) Example 11. “Normal” approach 
is a call to another subroutine, then replace that CALL 
instruction (and the RET instruction that follows) with a 


JMP to the called subroutine. This technique works be- Dx, 1000 ; 1000 data points. 

cause the first subroutine uses the second subroutine’s ee 
RET instruction to return to the original calling program. SI 

The net result is that a byte or two of code is saved and cl ee 
execution time is typically reduced by 27 clock cycles. a ; 
You must ensure, however, that the calling subroutine 62,996) clock cycles 


13 bytes 


and called subroutine are the same type (NEAR or FAR). 
Otherwise, the return address will be the wrong length 
when it is popped off the stack during execution of the 
last subroutine’s RET instruction. 


If program execution depends on the result of a subrou- Example 12. Faster Loop 
tine called by the main program, consider setting the flags : 
in the subroutine (TEST or CMP the applicable parameter) MOV SI,OFFSET TBL_ST ; SI is the data pointer. 
* ° . . . MOV DX, 1000/4 ; 250 iterations (4 data 
just prior to its RET statement. This allows all calling + “pointe teachspaae)’- 
programs to use a conditional jump instruction without Sa 
testing the condition of the applicable variable, which can rage fei ee srs eens 
save quite a bit of code if the subroutine is called several SHL WORD PTR [SI],CL  ; Normalize a data point. 
times Inc. SI ; Next data point, 
* SI 

WORD PTR [SI],CL ; Normalize a data point. 
Decisions, Decisions! rt 1 Mext data point. 
Conditional jumps are found wherever a program must WORD PTR [SI],CL  ; Normalize a data point 
make a decision. Although these instructions cannot be 7 Next data point. 
entirely eliminated, we can usually optimize decision- piapolnte Navel beannerealls= 
making to hold program jumps to a minimum. In many ey pees joe 
cases, this can be done by using the data that the program ; ae Tas 
execution path depends upon as a pointer to the routine 25 bytes 


to be executed. 

For example, suppose we are writing a command dis- 
patcher that must select 1 of 10 routines according to the 
ASCII digit contained in AL. Two ways to do this are shown 


in Examples 13 and 14. Example 13 is not only bigger and Example 14 

slower, but also suffers from addressing range restric- ; 

tions imposed by conditional instructions; each routine o 

(ONE, TWO, etc.) must be within +129 to —126 bytes of its cMP AL, : 

respective jump instruction since conditional jumps con- a SUB AL, #Convert data to 

tain only a one-byte displacement relative to the location : > = C cipes @ tr, 

of the conditional jump. We could use an intermediate he caw 

jump table to get around this restriction, but this would . pas 

add 30 bytes of code and increase the execution time by sno gMP TABLE[DI] ; Execute routine 
TABLE DW ZERO, ONE, TWO, THREE, FOUR 


15 clock cycles. Example 14 allows the command routines - ca Five cel eee Te ae 
to be anywhere in the 64K program segment without ERROR: a eS 
requiring an intermediate jump table. 

What if the allowable values in AL are a set of random 
ASCII capital letters instead of consecutive numbers? We 


20-80 (50 ave) clocks 


40 bytes 42 clocks/35 bytes 
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AdvanTech 1-800-338-3130 
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can still use Example 14 with only a slight modification —; Compact Leading Zero Counter 


as shown in Example 15. This routine uses much more MOV CX, 16 ; AX=0 => 367 clocks 
memory than Example 14, but it is still faster than Exam- CHKBIT: SHL AX, 1 ; AX>7FFFH => 29 clocks 
ple 13. Jc FOUND 
If we need to use subroutines instead of jumps, Example LOOP CHKBIT 
13 becomes very inefficient: FOUND: SUB CL, 16 ;Return count 
NEG CX 7 inch. 
CMP AL, "0" Although this is a tight piece of code that looks fast, a 
JNE NOTO 
CALL ZERO 
JMP DONE Example 15 
NOTO: CMP AL, "1" . neers 
SUB AL, "A" ; Remove ASC. ias 
JNE NOT1 CMP OAL, "T"-"A™ ; Highest character allowed 
CALL ONE JA ERROR H is letter "T". 
CBW ; Convert the data toa 
JME DONE SHL AX,1 ; routine pointer. 
NOT1: . MOV DI,AX 
UMP TABLE[DI] 
. ERROR: 
ERROR: CALL NONE ‘ 
DONE: : ; Command routine addresses, Legal chars. are ’ACDGHJPRST’. 


TABLE DW RA, ERROR, RC,RD, ERROR, ERROR, RG, RA, ERROR, RJ 
DW 5 DUP (ERROR) 
DW = RP, ERROR, RR, RS, RT 


clocks/ es 

We could improve it by putting the return address on Sete ee: 
the stack and jumping to the subroutine instead of calling 
it: 


Example 16 
; "FAST" Leading Zero Counter 
MOV CL,8 7At least 8 leading zeros 
MOV BX, OFFSET DONE OR AH,AH 3 if most significant 
PUSH BX v2 GT8 ; byte is 0. 
no" XOR CL,CL 
CMP AL, "0 cy aa 
JE ZERO : CMP AL,8 3>4 zeros in LS byte? 
wae JB GT4 
CMP AL, "1 CMP AL,208 ;I£ not, >2 zeros? 
JE ONE JB GT2 
CMP AL,40B ;I£ not, >1 zero? 
JB INC2 :I£ >1, must be 2 leading 0s. 
. CMP AL,80H ;If not, then there are 
DONE: JAE INCO 3 no leading Os or... 
. as UMP SHORT INC] ; only 1 leading zero in LSB. 
: CMP AL,10H ;If >2, then there are either 
JAE INC3 ; 3 leading Os or... 
. . . . C3 i he B. 
_ However, we still have the same jump distance restric- - Ge ee ae oe, ara tare 8 varest 
tion as in Example 13, plus we pick up 15 extra clock 3B GTT 
cycles of execution time and 4 bytes of code. The code in a ok. - Secieeccs: 
Examples 14 and 15 can be used directly by replacing JMP UMP SHORT INC6 ; 6 leading Os in the LSB. 
TABLE[DI] with CALL TABLE[DI]. This adds only three = ‘oe 
clock cycles to the execution time without adding any INC ; 8 leading 0s in the LSB. 
bytes of program code. : # 
The technique discussed above is effective for selecting : Ine 
a single execution path from many choices. When the . uiseineamacenierictl 
° s e : 3 * OC. 
number of possible outcomes is less than eight, or one ee a-tecie 


outcome has a much higher probability of occurring than 
the others, the direct approach shown in Example 13 is 
the fastest and most compact. 

Another example of decision-making that vividly illus- 
trates the relationship between code size and execution 


speed follows. This routine counts the number of leading sciiinianinie 
zeros in the piece of data contained in the AX register; #PEASTEST" (and largest) Leadtog (bere counter 
such a routine might be found in data scaling and floating 1U_TBLE SEGMENT PARA 


. TABLE DB 16,15,14,14, 4 DUP (13) 
point math programs. DB 8 DUP (12), 16 DUP (11), 32 DUP (10) 


DB 64 DUP (9), 128 DUP (8), 256 DUP (7) 
DB 512 DUP (6), 1024 DUP (5), 2048 DUP (4) 
DB 4096 DUP (3), 8192 DUP (2), 16384 DUP (1) 
DB 32768 DUP (0) 

LU_TBLE —-ENDS 


Table 3. Speed compared to program size 


| average | 
relative speed | size (bytes) | execution time | CODE SEGMENT BYTE PUBLIC 


ASSUME DS:DATA, CS:CODE 
! 

slow & compact | 14 

faster & bigger | 56 

fastest & biggest | 65545 


BX,LU_TBLE ;Use data as a pointer 
DS, BX : into a leading zero 
SI,AX : lookup table. 

CL, (ST] A 
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jump is required for every leading zero. Assuming equal 
probability of any number of leading zeros, the average 
execution time of this program is 191 clock cycles. 

If we use a binary-search type of algorithm, we find we 
only need to make four decisions (program jumps) at 
most (24 = 16 bits). An example of such a routine is shown 
in Example 16. Again assuming equal probability of any 
number of leading zeros, this example is more than twice 
as fast, on average, but the code is four times as large. 

Finally, if speed is of utmost concern and size is not a 
consideration, we can use the “brute force” approach 
shown in Example 17. The amount of memory used by 
this example may seem to make it a ridiculous approach, 
but it is the fastest and can be readily justified when 
minimum execution time is required. After all, memory 
is cheap! 

Although this is a specific case, the leading zero counter 
program is a good example of how speed and size can be 
traded off when a multi-outcome decision must be made. 
The performance characteristics of the three examples 
are shown in Table 3 for comparison. 

In most decision matrices, execution time varies ac- 
cording to the value of the input data. Where the data falls 
within a specific range most of the time, the code should 
be written to minimize the number of program jumps 
executed when the data is within the “expected” range. 


Miscellaneous Tricks and Techniques 

When using register operands, there are several common 
tasks that can be accomplished in a few different ways. 
With a little thought, these tasks can be streamlined to 


cut clock cycles. 

XORing a register with itself is one clock cycle faster, 
and one byte shorter (for 16-bit registers), than MOVing 
0 into it. If you need to clear several registers and/or 
memory locations, clear a register and then MOV it into 
the other operands. 

How about testing a register for 0? Normally you do 
this by comparing the register with 0. ORing a register 
with itself accomplishes the same objective while saving 
one clock cycle and a byte of code. If you need to test a 
memory operand for 0 and have a cleared register, CMPing 
the register with memory is one clock faster and saves a 
byte over directly comparing the memory operand with 0. 

Don’t forget that the stack is really memory. If you 
need to discard some values off the stack, manipulate the 
stack pointer (ADD SP2) in lieu of POPing numbers off 
the stack. For one discarded value, this method will save 
four clocks; for two values, 12 clock cycles will be saved. 
In situations where code size is restricted and only one 
or two values must be discarded, use POP instructions. 

If you need to add or subtract 2 to or from a 16bit 
register, use two INC (DEC) instructions instead of an 
ADD (SUB) instruction. The execution time is the same 
but you will save a byte of code. 


Prefetch Queue Operation 

Unlike previous generation microprocessors (Z80, 8080, 
6800, etc.), the 8086 family CPU architecture supports 
instruction pipelining. This is made possible by the use 
of a Bus Interface Unit (BIU) and an Execution Unit (EU) 
in the CPU that operate separately and relatively independ- 


Performance Programming 
Rules of Thumb 


To save execution time: 


1 
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Keep often-used data and often- 
manipulated data in registers 
for as long as possible. 
Minimize the use of memory 
variables as destination oper- 
ands in instructions to save 
seven clock cycles per instruc- 
tion. 

On 8086 systems, align all 16- 
and 32-bit memory operands on 
even address boundaries. 
Convert byte-string MOVE and 
STORE operations to word- 
string operations. 

Use individual MOVS (STOS) in- 
structions instead of a REP’ed 
string instruction for strings of 
14 elements or less. 

Use the shift and rotate instruc- 
tions for multiplication and divi- 
sion (instead of MUL and DIV) 
when one of the operands is a 
constant. 

Use macros instead of subrou- 
tines. 

Eliminate unconditional jump in- 
structions in looping routines 
by replacing each required itera- 
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tion with a macro. When a large 
number of iterations is required, 
perform multiple operations in 
each loop iteration. 

Adjust SP (instead of using the 
POP instruction) to discard val- 
ues off the stack. 


. To save a few extra clock cy- 


cles, intermix fast and slow exe- 
cuting instructions when possi- 
ble. 


To save code size: 


i 


2. 


Group often-used memory vari- 
ables into a 256-byte block and 
use indexed (offset) address- 
ing mode to access the data. 
Use AAM and AAD instructions 
when dividing and multiplying 
by 10. 

Avoid macros. Replace all sec- 
tions of code that appear more 
than once in the program (and 
longer than 4 bytes) with a sub- 
routine. 

Use the SHORT assembler direc- 
tive with the JMP instruction 
when forward referenced labels 
are close (within 129 bytes) to 


the JMP instruction. 

Use individual MOVS (STOS) in- 
structions instead of a REP’ed 
string instruction for strings of 
four elements or less. 


To save execution time and minimize 
program size: 


1. 


Minimize forward references by 
placing all data and constant 
declarations at the beginning 
of the source code. 

In large programs, distribute 
memory variables among the 
smallest number of 64K seg- 
ments possible. Set the segment 
registers to take advantage of 
the default segments used by 
the instructions that access the 
data. 

Command dispatchers and sim- 
ple-decision matrices should use 
table pointers (derived from the 
input data) instead of several 
conditional jump instructions 
when the number of possible 
outcomes is eight or more and 
all outcomes have a fairly equal 
probability of occurrence. 
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ently of each other. The BIU fetches instructions and data 
from memory while the EU executes the instructions 
fetched by the BIU. Combined with an internal prefetch 
queue, this architecture enhances program speed by al- 
lowing instruction fetches and instruction execution to 
occur in parallel. 

During normal operation, the BIU fetches instructions 
from memory and stores them in a 6-byte Instruction 
Prefetch queue (4-byte on the 8088) while the EU exe- 
cutes these instructions. When the EU needs a memory 
operand to execute an instruction, e.g., MOV AL,[BX] [DI], 
the bus interface unit fetches the required memory oper- 
and. Under ideal conditions, the BIU will maintain at least 
one unexecuted instruction in the prefetch queue while 
responding to all EU-generated memory access requests. 
This causes the CPU to run at maximum efficiency; the 
EU continuously executes instructions and the CPU bus 
is busy most of the time. 


Any instructions 
that alter program 
flow will reduce 
CPU throughput 
because after 

the program 
transfer occurs, all 


instructions are 
flushed from the 
prefetch queue. 


Any instructions that alter program flow (prevent in-line 
code execution) will reduce CPU throughput (instructions 
executed per second) because after the program transfer 
occurs, all instructions are flushed from the prefetch 
queue. After the Instruction Pointer is loaded with a new 
value, the EU must wait for the BIU to fetch the next 
instruction before the EU can continue program execu- 
tion. This is why the control transfer instructions take so 
long to execute. If program jumps occur too often, the BIU 
wastes a lot of time fetching instructions that are never 
executed. Therefore, to enhance program speed, mini- 
mize use of software interrupts, subroutines, and program 
jumps as discussed earlier. 

If the program contains many fast executing instruc- 
tions in a row, the EU will “empty” the prefetch queue 
faster than the BIU can “fill” it. This will result in short 
periods of time where the EU is idle while waiting for the 
BIU to fetch instructions. In 8086 systems with no wait- 
state, 16-bit memory, this problem occurs if consecutive 
instructions execute faster than two clock cycles per 
opcode byte because one instruction fetch cycle (16-bit 
memory read) takes four clock cycles to execute. The 
situation is aggravated further in systems that are already 
bus bandwidth limited much of the time; computers that 
use large amounts of slow memory (one or more wait 
states) and/or dynamic RAM fall into this category. 
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development fools: The answer a a B deninite 
=YES:* 


Locate C Bugs 
BEFORE they Bite with 


PC-lint 


PC-lint will analyze your C programs (one or many 
modules) and uncover glitches, bugs, quirks, and 
inconsistencies. It will catch subtle errors before they 
catch you. By examining multiple modules, PC-lint 
enjoys a perspective your compiler does not have. 


PC-lint is full K&R plus ANSI extensions. It will find 
argument/parameter mismatches, inconsistent 
declarations, uninitialized variables, unaccessed 
variables, suspicious macros, unreachable code, 
indentation irregularities, function inconsistencies, 
unusual expressions, printf-scanf format 
irregularities, and much, much more. All error 
messages can be turned off locally and globally. 


. aremarkable well thought-out product which 
will check for just about every conceivable : 


é coding error... Its value increases with frequent 
use ... we confidently recommend PC-lint."| 


Andrew Binstock, The C Gazette 


“PC-lint has everything going for it: flexibility, 


price. ‘Texercised the product daily ona large, . 
working, project to see if! could include it in my : 


Seen D. Cpa i Blue Notes . 
San Francisco PC Users Group 


.. friendliness i is a prime consideration i in 


: Pelint Gimpel has provided ways to make Lint : 
shut up about all those errors you either Know 
-ordon’tcare about. lfUnix-Lintwas sts 
_implemented as ‘well, | would use it more.’ es 


Don Malpass, IEEE Sofiae” 


NOW AVAILABLE -- Generic Lint in shrouded 
source form for use on VAX/VMS, Unix, Xenix, 
Microport, Versados, IBM VM/CMS - MVS, 
OS-9, etc. Requires only K&R C to compile. 
Prices start at $798. Call for details. 


2 
Gimpel Software 
3207 Hogarth Lane 
Collegeville PA 19426 
(215)584-4261 


PRICE: $139.00 first copy, $100 each additional 
VISA, MC,COD - PA residents add 6% sales tax 
30 Day Money-back Guarantee -- Specify MS-DOS or OS/2 
Works with any MS-DOS C compiler - direct support for 12 
C compilers including Microsoft, Turbo, Lattice, Desmet 
PC-lint and Generic Lint are trademarks of Gimpel Software. 


CIRCLE 75 ON READER SERVICE CARD. 


Micro/SysteMS 55 


The bottom line here is that actual execution time may 
differ from calculated execution time by several percent. 
The amount of deviation depends on the number of control- 
transfer instructions and the sequence of instructions in 
the program. 

Example 18 illustrates this point. It calculates the aver- 
age of two 40-bit numbers located in SOURCE and DEST. 
As shown in Table 4, the actual execution time is 11 
percent greater than the calculated execution time. By 
simply rearranging the program so that the fast (RCR) 
instructions are interleaved with the slower (MOV 
DEST[SI+6],DI, etc.) instructions, the “excess” execution 
time is reduced by 4 percent. The performance improve- 
ment is admittedly small, but it may be significant in 
severely time-constrained applications. 

I am certainly not suggesting you conduct a prefetch 
queue performance analysis whenever you write code. 
Just remember its impact on performance if you need to 
squeeze a few more clock cycles out of your program. 
And if you are writing a routine that must provide a precise 
time delay, you should measure its execution time rather 
than relying on a timing calculation. Section 4.3 of The 
8086 Family User’s Manual contains a sample detailed 
analysis of prefetch queue operation. 


Memory Organization 

and Segment Register Management 

The 8086-family CPU uses segment registers to define 
separate code and data portions (called “segments”) of 
memory. These segments may only be 64K (65,536) bytes 
long because the memory addresses specified in instruc- 
tions and registers are restricted to 16-bit numbers. When- 


Example 18 


CODE SEGMENT PARA PUBLIC 
ASSUME CS:CODE,DS:CODE 


SOURCE DW 
DEST DW 


5 DUP (?) 
5 DUP (?) 


MOV SI, OFFSET SOURCE 

LODSW 

MOV ‘BX, [SI] 

MOV CX, [SI+2] 

MOV DX, [SI+4] 

MOV DI, [SI+6] 

ADD AX, DEST[SI-2] 
BX, DEST [SI] 
CX, DEST (SI+2) 
DX, DEST [S1+4] 
DI, DEST (SI+6) 


; Initialize data pointer, 
; Move SOURCE to registers 
for fastest execution, 


; Sum SOURCE and DEST in 
registers. 


: Divide the sum by two. 


aX, 1 
DEST[SI+6], DI 
DEST[SI+4], DX 
DEST[SI+2], CX 
DEST(SI], BX 
DEST[SI-2], AX 


; Save the results in DEST. 


ENDS 
END START 


Example 19 


Initialize segments Swap source and destination 


string data segments 


MOV AX, OFFSET DATA_SEG MOV AX,ES 
MOV ES,AX MOV BX, DS 
MOV DS,AX MOV DS, AX 

. MOV ES, BX 


7 bytes/8 clocks 8 bytes/8 clocks 
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ever the CPU accesses system memory, it combines the 
16-bit address (specified in the instruction or instruction 
pointer during memory operand and instruction fetches) 
With one of the segment registers to generate a 20-bit 
physical address. This means the Data Segment (DS) and 
Extra Segment (ES) registers must be initialized to point 
to data memory before any memory operations may be 
performed. For DOS .COM files, the operating system 
takes care of this task during loading and the program 
should not normally manipulate the segment registers. 

All DOS .EXE program files require the program to set 
the DS and ES registers to point to data memory. Pro- 
grams that use less than 64K of data memory can get 
away with simply setting these registers once at the 
beginning of the program. 

Memory variables in programs requiring more than 
64K of data must be distributed between two or more 
segments (64K maximum per segment). This is done by 
declaring the variables in different segments (SEGMENT 
and ENDS assembler directives) in the source program. 
The manner in which this is done affects performance 
because a combination of segment register manipulation 
and segment register override techniques are required 
to access the data. 

Segment register operations can become fairly cum- 
bersome because only a limited number of instructions 
(LES, LDS, PUSH, POP, and MOV) can directly manipulate 
the registers. Segment register contents and immediate 
operands cannot be MOVed directly into a segment regis- 
ter. Thus, a few instructions are required just to set a 
couple of segment registers as shown in Example 19. 
While the code may seem trivial, the resulting perform- 
ance degradation becomes significant when the segment 
registers need to be changed frequently to access data in 
different segments. If memory operands are divided into 
the smallest number of segments possible, program speed 
will be improved and program size will be reduced be- 
cause less segment register manipulation will be required. 

Most of the data manipulation instructions and ad- 
dressing modes use the DS register as the default seg- 
ment for calculating the 20-bit physical address. You can 
use the segment override prefix with most instructions 
to change this when convenient. Table 5 illustrates which 
segment registers may be used in the various types of 
memory accesses and addressing modes. 

You should keep segment register usage in mind when 
writing code. For example, although it seems convenient 
to place a program jump table right after the code that 


Table 4. Prefetch queue operation example results 


Calculated | Actual Execution Time (clock cycles) 


execution | Example 17 i Modified example 17 
time | Q wait state | 1 wait state | 0 wait state | 1 wait state 


Table 5. Matching segment registers and accesses 


Memory Access Default Segment Alternate Segment 
register registers* 
Instruction fetch none 
Stack operation none 
String source operand Cs, ES, SS 
String destination operand none 
Memory operand (addressing cs, DS, ES 
modes that use BP) 
Memory operand (all other 
addressing modes) 


Cs, ES, SS 


* Altered by using segment override prefix. 
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Catan o WB oG AR {PT 1’O N 


PAINZY ATA lL OUN 


Here At Last! 


The Authoritative Source 
of Database Application and 
Product Information 


DBMS Magazine offers in-depth 
articles examining the capabilities 
of programmable databases as tools 
for solving real business problems. 
Authoritative features show how 

a specific product is used to solve a 
specific problem. Sample some of 
the feature articles to be presented: 


¢ 50 Nasty dBase Bugs, and How to 
Squash Them 

¢ Has Ashton-Tate Finally Designed 
a Decent, Multiuser Database? 

¢Why 386 Databases Can be Either 
the Fastest, or a Waste of Money 

¢Is FoxBase+ dFastest Around? 

¢Why Microsoft and Ashton-Tate 
teamed up on SQL: Server — What 
this Ambitious Product Will and 
Will Not do for You 

e R:Base Puts OS/2 to Work 


e Understanding Paradox's PAL 


| OAs 
rabase 
{ncreasins PC os H 


¢ Can Oracle Squeeze Into a PC? 


..With regular columns written by 
experts, providing tips on developing 
and modifying databases, such as: 


¢ Putting dBase IV's Arrays to Work 
e Cross tabulations in R:Base 


e Sharing Files Between 1-2-3 and 
dBase 

¢ Using Clipper's Valid Statement 

¢ Tips on Creating Reports in Paradox | 


Who Should 
Read DBMS 
Magazine? 


Professional developers, MIS/DP 
departments, consultants, VARs 
and serious users will find DBMS 
Magazine to be the source of 
important information for serious 
users of microcomputer database 
software. 


oe 


Charter Offer and 
Benefits with 
No-Risk Guarantee 


For a limited time DBMS Magazine 
will offer Charter Subscriptions with 
substantial cash savings. 


You will also be entitled to continued 
savings — on all renewals and on any 
gift subscriptions you may order. 


If at any time you decide DBMS is not 
for you, simply cancel. We will send 
a full refund for all unmailed issues. 


Order Today! 


Send your order form to: 


DBMS Magazine 
P.O. Box 3713 
Escondido, CA 
92025 


Catena nie sot Oeb oC nee or | O oN  O°R Dp EsR = FO REM 
| 
| 


[| Yes, please send my first issue and enter my Charter Subscription to 
DBMS Magazine. My full year's subscription (11 additional issues) is only $19.97. 


I save 44% off the newsstand rate. 
SEND NO MONEY! 


We will bill you later, after you receive your first issue. GUARANTEE! You may 
cancel at any time for any reason and receive a full refund on all unmailed issues. 


MAIL TO: DBMS Magazine, P.O. Box 3713, Escondido, CA 92025 


NAME 


COMPANY 


ADDRESS 


CITY/STATE/ZIP 
Allow 4-6 weeks for delivery of first issue. 


Newsstand rate (lyear): $35.40 


] | don’t like working with 
e others 

PeaceNet is a computer network and communication 
system for people who believe that global planning 
and cooperation are necessary to reverse a trillion-dol- 
lar-per-year arms race; it is linking users throughout 
the United States and in over 70 other countries. 


I’ve got all the information I'll 
eever need 
PeaceNet is for those who appreciate that information 
is always growing and changing; its bulletin boards, 
conferences, and databases provide information about 
everything from Central America to Star Wars. 


3 I love playing phone tag 

@ PeaceNet’s electronic mail system renders those 
endless conversations with secretaries and answering 
machines obsolete. 


I don’t know how to use my 
e computer 


PeaceNet helps novices with simple, entertaining man- 


uals and round-the-clock staff for answering their 
questions. 


5 I enjoy copying: labeling, and 
e stamping letters 

PeaceNet enables you to send messages to hundreds of 
other users with one simple command. 


Peace and anti-nuclear efforts, Central American issues, 
environmental protection, anti-apartheid efforts, human 


I’ve got plenty of money 
eto waste on postage and 
phone bills 
PeaceNet is for people who want to save money; it lets 
you send documents across the world faster than 
Federal Express™ for pennies per page. 


I don’t mind getting action 
ealerts a week late 

PeaceNet does mind and can help your organization 

send out time-urgent alerts instantly. 


| don’t have the right kind of 
e computer equipment 
PeaceNet is available to anyone with a computer termi- 
nal and a modem. 


9 An effective peace movement 
eisn’t worth 50 cents a day 
PeaceNet users disagree. 


10 It’s all hopeless, anyway 
@ Then why read this magazine when Modern 


Wrestling would suffice? 


Use PeaceNet’s electronic mail, conferences, bulletin 
boards, databases, and Telex services to gather, 
organize and share valuable up-to-date and 
comprehensive information about: 


rights, Native American issues, and much more. 


PeaceNet is the largest and most rapidly growing on-line community of progressives anywhere in the world. 
And by using any home computer and a modem it is only a local phone call away. Call or write today for 


more information. 


wv PeaceNet: 


3228 Sacramento Street 
San Francisco, CA 94115 (415) 923-0900 


uses it (in the code segment), the instructions that access 
the table require a segment register override prefix if the 
DS register is not pointing to the code segment. (This is 
the normal case.) This segment override prefix adds a 
byte of code and a couple of clock cycles. If you have used 
the ASSUME assembler directive correctly, this will go 
unnoticed because the assembler will automatically gen- 
erate the segment override prefix for you. The prefix can 
be eliminated (making the code more compact and faster) 
by moving the jump table into a segment that is pointed 
to by the DS register. 

In programs containing many data segments, the mem- 
ory operands should be distributed to take advantage of 
the default segment registers that the various instructions 
use. And, if all data can be contained in two or three 
segments, you can save execution time and minimize 
code size by assigning a segment register to each segment 
and using the segment override prefix as necessary. For 
example, if a spelling checker program uses a 32K string 


Optimizing assembly 
language programs... 
is not difficult, 

but it requires strict 
attention to detail 

and an in-depth 
knowledge of the 
instruction set and 
CPU architecture. 


variable area for text manipulation, a 32K string constant 
area for string comparison operations, and 4K of miscella- 
neous memory variable area, the memory variable areas 
should be placed in one segment and the string constant 
area should be placed in a different segment. The DS and 
ES registers would be set to point to the variable and 
constant data segments respectively. This would mini- 
mize segment register manipulation and segment over- 
ride prefix usage because most data manipulation instruc- 
tions use the DS register while the string comparison 
instructions (CMPS and SCAS) use the ES register. 


Programming Practices 

Despite your best efforts, there are certain occasions 
when the assembler will generate more code than you 
intended. This can be prevented by understanding how 
the assembler operates and by following good program- 
ming practice. 

Most assemblers are two-pass assemblers; they scan 
the code twice before the machine code is completely 
generated. During the first pass, the assembler allocates 
space for the opcodes and data, assigns values to labels 
and constants, etc. The assembler allocates the maximum 
amount of room for instructions that can contain either a 
word or byte displacement (indexed memory operands 
with a displacement and short JMP instructions) if the 
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size of the displacement is undefined due to a forward 
reference (a label or operand definition that appears after 
it is used in an instruction). Additionally, immediate oper- 
ands in 16-bit ADD, ADC, SUB, SBB, and CMP instructions 
are allocated one word of memory if the value of the 
operand is defined after the instruction. (A 16-bit immedi- 
ate operand in the range +127 to -128 can be encoded as 
a byte to reduce code size. The byte is then sign-extended 
to its 16-bit equivalent during program execution.) 

During the second pass, the assembler fills in the 
opcodes with the displacements and immediate operands 
that were assigned upon completion of the first pass. If 
any displacements/immediate data can be reduced from 
16-bit to 8bit numbers, the assembler will modify the 
opcodes of the affected instructions and insert NOP in- 
structions in the unused portions of opcode area allocated 
during the first past. This adds one byte of code and three 
clock cycles per NOP instruction. 

You can avoid this problem by putting all constant 
definitions (EQU statements) and data variable allocation 
directives at the top of the program. Short unconditional 
jumps to forward-referenced locations within 129 bytes 
of the JMP instruction can be forced to assemble in two 
bytes (instead of three) by using the SHORT pseudo-op. 
This causes the assembler to allocate only one byte for 
the displacement during the first pass. (Refer to The 
Macro Assembler Reference Manual for details.) 

To prevent performance degradations due to memory 
operand alignment in 8086 systems, be sure all word 
operands are located at even addresses. Do this by using 
word- or paragraph-aligned data segments (defined in the 
SEGMENT assembler directive) and placing all word-data 
variables at the beginning of the segment. If you must 
place word variables in the middle or end of the data 
segment, group the variables together and use the EVEN 
directive to restore operand alignment. 


Conclusion 

Optimizing assembly language programs for speed and 
size is not difficult, but it requires strict attention to detail 
and an in-depth knowledge of the instruction set and CPU 
architecture. This article should provide some insight into 
methods of exploiting the instruction characteristics and 
architecture of the 8086 family CPU to achieve maximum 
performance. 

Although the performance of the 80286, 80386, V20, 
and V30 CPUs was not covered, all the techniques de- 
scribed still apply. In general, instructions execute in 
fewer clock cycles on these processors than on the 8086 
and 8088. Oo 
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Technical Brief (continued from page 19) 


OS/2 has effectively two command-line interfaces, 
both of which can be replaced like COMMAND.COM. 
One interface is for the DOS compatibility box that 
runs in real mode while the other interface is for 
protected mode. There may be only one of the former, 
the DOS command processor, and any number of the 
latter, the OS/2 command processor, due to the limi- 
tations of OS/2. Multiple command processors (com- 
mand-line interface programs) run concurrently un- 
der OS/2. The two different types of command proc- 
essors can cause a number of problems. The DOS 
command processor is essentially compatible with 
DOS’s COMMAND.COM. 

This allows most applications and batch files to run 
under OS/2. However, there are a number of com- 
mands that are not replicated in the OS/2 command 
processor and an equal number that the OS/2 com- 
mand processor supports that its counterpart does 
not. This can be quite confusing as well as frustrating. 
It is also one reason why the DOS compatibility box 
will be used sparingly. 

One thing that may keep people away from these 
command processors is the menu front end supplied 
with OS/2. This menu system is used primarily for 
starting applications which, for the most part, is what 
the command-line programs do. 

The UNIX command processor, like DOS and OS/2, 
is also replaceable. There are a number of UNIX shells 
(the same terminology is often used with DOS and 
OS/2) that have become popular like the Cshell or 
csh, and the Bourne shell. The UNIX shells perform 
the same functions as the DOS and OS/2 command 
processors: execute programs, built-in command files, 


and batch files. There is no comparable standard to 
the OS/2 menu front end for UNIX. 

There are differences between the UNIX and OS/2 
command processors, such as differences in built-in 
commands and utility programs. The built-in DOS 
directory function, DIR, is functionally identical to the 
UNIX /s program. Batch files have different file exten- 
sions, and the file name sizes are different between 
UNIX and OS/2. 

Both the OS/2 and UNIX command processors 
provide support for the underlying multitasking sys- 
tem by allowing multiple programs to be started at 
the same time as well as linking programs together 
via pipes. Pipes appeared first in UNIX and are found 
in DOS, although DOS only simulates pipes using files 
because DOS cannot run multiple programs simulta- 
neously. An example of a command line using pipes 
would be: 


DIR | FILTER | MORE 


The command is actually syntactically correct for 
both OS/2 and UNIX (provided the three files exist 
on the UNIX system). The command processor starts 
off three programs with the output of one being 
supplied to the standard input file of the next program. 
Programs that process information in this fashion are 
called filters. Combining batch files, pipes, and other 
multitasking features of the command processors is 
often all that is necessary to build or support an 
application. Power users tend to take advantage of 
these features, but most users of OS/2 and UNIX do 
not even know they exist let alone their implications. 


With Prospero PC Pascal 
and PC Fortran you never 
lose your (train... 
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Programming Interface 

OS/2 and UNIX provide a programming interface that 
is very similar but differs from DOS. Functionally, all 
three provide basic operating system functiens includ- 
ing file manipulation and memory allocation. Both 
OS/2 and UNIX, however, provide more complex 
process control as well as interprocess communica- 
tion functions. 

DOS is designed specifically for the Intel 8086 mi- 
croprocessor family and uses an .INT instruction to 
access low-level and operating-system functions. Pa- 
rameters are passed in registers and results are some- 
times returned in the flag register. DOS uses a linking 
loader that can resolve memory references within the 
program itself. DOS provides a conventional set of file, 
memory, and process functions. Low-level peripheral 
functions are accessed using BIOS interrupts. 

OS/2 and UNIX also use a linking loader, but access 
to operating system functions is done through calls 
instead of .INT instructions. All parameters are passed 
on the stack and results are returned in a register. 
This makes the operating systems more compatible 
with high-level languages. It also shows the influence 
of C, which was used to write a good deal of both 
operating systems. This also makes it easier to inter- 
face high-level language applications. The use of calls 
for all system functions has an additional advantage 
because there is no difference between an operating 
system function call and a call to a local function. This 
feature is how the operating systems can be transpar- 
ently extended to applications. The extensions are 
called dynamic link libraries in OS/2 and shared librar- 
ies in UNIX. In fact, most of the operating system 


features are implemented using these mechanisms. 

Both OS/2 and UNIX are designed to operate in a 
protected environment. This prevents an application 
from directly accessing peripheral functions using 
INT instructions or directly accessing peripheral ports. 
The protected environment means all support functions 
are provided by the operating system or its extensions. 

OS/2 and UNIX both provide file-access functions 
using handles. The file system is hierarchical in both 
cases but OS/2, like DOS, has multiple roots, one for 
each disk drive. Both operating systems provide ac- 
cess to devices using logical file names. 

Memory allocation functions with the ability to 
share memory between tasks are provided by both 
operating systems. However, tasks within OS/2 differ 
slightly from UNIX. OS/2 calls a process a logical entity 
that executes one or more threads that share re- 
sources, like files and memory, allocated to the proc- 
ess. A thread has an execution state and priority. A 
process is part of a session, which is a group of related 
processes. The idea is to keep the overhead associ- 
ated with threads low so they can be quickly created 
and terminated to make multitasking more suitable 
for general use. 

UNIX provides a similar multitasking environment. 
A UNIX process is more like an OS/2 process with a 
single thread. UNIX provides a fork function that splits 
a UNIX process into two separate execution units. The 
UNIX forked processes share the same code and data, 
but the overhead per item tends to be higher than 
with OS/2. Both operating systems provide a way to 
start up a completely new and independent program 
that can be under the control of the parent program 


...of thought. 


Introducing the world’s most productive programmer’s environment: 


The Prospero Workbench. 
hether you program in a program in sequence, without inter- than 20,000 programmers around 
Pascal or Fortran, you'll be ruption. So, you can never lose track of the world. Prospero also provides tech- 
more productive with the a complex variable in Fortran, or get nical support, just in case you do get 
Prospero Workbench. It’s side-tracked in Pascal. side-tracked. 
a fully integrated programmer’s environ- With pop down menus and four or Call 800-327-6730 to order or to 
ment for Fortran. And the world’s most eight editing windows, the Prospero request more information, and get your 
creative environment for Pascal. Workbench lets you know what’s programming on track with Prospero 
happening all the time. You can read PC Fortran and PC Pascal. 


at the press of a Key. 


The Prospero Workbench lets you how important that is. 


and edit files in different windows, and 
easily copy text from one file to another— 


PC FORTRAN ‘199 
PCPASCAL = 149 


PC Fortran and PC Pascal include 
the widely praised symbolic debugger 
PROBE. And Prospero fully implements 
ISO and ANSI standards. If you prepare 


programs on a microcomputer, and then 1-800-32 7-6 730 


run them on a mainframe, you know 


design, write, debug, test and document Prospero Software is used by more 
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Prospero Software 


LANGUAGES FOR MICROCOMPUTER PROFESSIONALS 


100 Commercial Street, Portland, Maine 04101 
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or independent of the parent. 

OS/2 and UNIX provide various interprocess com- 
munication facilities. Both provide unnamed pipes, 
that are sequential files that one process can read and 
another can write to. The OS/2 LAN Manager pro- 
vides named pipes which is a standard function pro- 
vided by UNIX. This makes UNIX a bit nicer to pro- 
gram in than OS/2. UNIX also has an extension being 
supported by AT&T called STREAMS, which is an 
extension on pipes that provides a number of features 
including message-oriented queues and remote proce- 
dure calls (RPC). The OS/2 LAN Manager can be used 
to build RPC support, but there is no standard similar 
to STREAMS. However, OS/2 does provide a queue 
mechanism unrelated to the file system that can pro- 
vide features similar to STREAMS. 

OS/2 and UNIX provide semaphores and asynchro- 
nous signals that are essentially identical. Shared 
memory can also be used for interprocess communi- 
cation. OS/2 inherently uses the 80286-segmented 
architecture for sharing memory. UNIX-shared mem- 
ory differs, depending upon the microprocessor on 
which it runs. 

So far, OS/2 and UNIX seem a lot alike. In fact, 
keeping to a high-level language such as C and the 
system functions already addressed will allow many 
applications to be easily ported between these two 
operating systems. The areas where the two operat- 
ing systems differ are in their handling of peripherals. 
OS/2 has the edge because of the limited number of 
different platform types. Essentially, each machine 
must be an 80286 or an 80386 running in the 80286 
compatibility mode, have a memory-mapped display, 
and have a keyboard capable of returning scan codes 
for each key movement. A mouse has a standard but 
optional interface. UNIX, on the other hand, runs on 
a multiplicity of processors with more options for the 
display and keyboard. This difference reveals itself in 
the peripheral control provided by the operating system. 

OS/2 has an extensive video display support for 
character-mode operations and access to graphic- 
mode displays but not at the same level as provided 
by the OS/2 Presentation Manager. Keyboard and 
mouse support has a similar functional complement. 
The main advantage of OS/2 is that a programmer 
can count on these functions and their associated 
support. UNIX systems usually come with libraries to 
support displays and input devices but applications 
often must be programmed to the lowest common 
denominator because you cannot count on having a 
memory-mapped display. The Curses library is one 
UNIX library that provides display and keyboard sup- 
port. It supports a wide variety of terminals and dis- 
plays by using a configuration file that describes the 
features supported by the terminal. An application 
does not have to worry about how functions are 
implemented. 

OS/2 and UNIX each have their share of unique 
functions. These range from handling a disk directly 
and manipulating the file system at a low level to 
handling peripherals such as printers and serial ports. 
Applications that deal with these functions tend to be 
very system-specific. 


Graphics Programming Interface 

Graphics tends to throw a monkey wrench into com- 
patibility. Windows from Microsoft and GEM from 
Digital Research are two graphics platforms for DOS. 
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They are also several user interfaces in addition to 
these two graphical programming environments. OS/ 
2 Version 1.1 will have an extended version of Win- 
dows called the OS/2 Presentation Manager. Like 
Windows, the Presentation Manager will be both a 
graphics interface for programs and a user interface. 
The complexity and number of functions in these 
graphics systems are actually greater than those pro- 
vided by the operating system. However, these envi- 
ronments provide a standard way of using graphics 
and integrating applications. In general, Windows is 
the dominant graphic environment for DOS, and Pres- 
entation Manager will dominate for OS/2. The visual 
similarities are striking. 

The UNIX user interface is both better and worse 
than OS/2. X-Windows is the graphics environment 
standard. There are others but X-Windows seems to 
be the dominant standard. Unfortunately, it is a graph- 
ics programming environment and not a user inter- 
face. There are a number of user interface contenders 
based upon X-Windows, including the recently an- 
nounced Open Look system. Open Look provides a 
way to define windows and dialog boxes like those 
found on the Apple Macintosh, Windows, and the 
Presentation Manager. 

The environments differ enough to make most 
applications environment-specific. It is possible to 
modularize programs to isolate the differences and 
use libraries to let a single program run under each 
of these graphic environments. However, the job is 
neither easy nor straightforward when it comes to 
implementation. 


Porting DOS Applications 

OS/2 and UNIX promise to provide platforms to which 
many companies will want to port existing DOS appli- 
cations. IBM recently displayed more than 300 OS/2 
applications ported from DOS at the Comdex show. 
Itis also a way to add multitasking features and access 
to multiuser environments that can enhance the func- 
tionality of many applications. Porting applications 
from DOS to either OS/2 or UNIX is possible and the 
ease with which it can be done depends on the applica- 
tion, how it is built, what facilities it uses, and what 
language it is written in. 

The language is the easiest to discuss. COBOL and 
FORTRAN tend to be the best for porting applications, 
not because they are great languages but because 
they have a standard base. Of course, using nonstan- 
dard features of these languages makes it just as hard 
to port an application written in another language. C 
is a favorite for porting applications because of the 
compatibility among compilers, but the ability to get 
close to the computer can also cause problems in 
porting applications. All bets are off using assembler, 
but going from DOS to OS/2 with an assembler appli- 
cation tends to be easier than going from DOS to UNIX 
unless the target version of UNIX is running on an 
80286 or possibly an 80386. However, problems tend 
to occur when dealing with memory segments and 
segment addressing. 

The facilities within the implementation language 
are one aspect of porting an application. System- 
specific facilities are another aspect. DOS-to-OS/2 
application migration tends to be easier in this area 
because the peripherals tend to be the same and so 
is the interface. UNIX usually falls short because the 
types of peripherals, like the terminals, tend to be 


Aucust 1988 


Don't fall flat on 


Sates Litvestans nomen Profit Poot 
1h dele TORPERRS VMAS IETF V OTE UCT TT T 


your Ashton. 


=H keevat Dighs Jeowseeoeaee 


Peceant No, 1126561 


prea 
Huse [Mational Computers Carp) 
125 


ie 
Si ie ori 


253-299-1998 seemssacsl 
0-1-1267 


Power up Q-PRO4™ Version 5.0. Get all the 
reliable, ‘bug-free’ features you need NOW! 


The Most Powerful Screen 
Handler You’ve Ever Seen. 


Q-PRO 4 offers all the features, 
the power, and the reliability you 
need. And it’s available now! 


@ What you see is what you get 
(WYSIWYG) no coding 

@ Fill-in-the-blanks screens 

@ Field editing with attributes 

@ Entry fields addressable as variables 
in the program 

@ Yes/no fields 

@ Multiple choice fields 

@ Entry fields symbolically addressable 

@ Programmable screen field types 

@ Picture data editing 

@ 16 color palette 

®@ Active field color 

@ Painted background 


Fully Featured Windows (optional) 

@ 100 windows per screen 

@ Multiple windows displayed 
simultaneously 

@ Frame and title 

@ Any size, any location 

@ Create WYSIWYG with included editor 

@ 3 Types: data entry, scrolling help, 
and data display 


No Limits 
Unlimited files open simultaneously 
(we have overcome the DOS limits). 


@ 2,000,000,000 bytes per file 

@ Unlimited record size 

@ Unlimited number of fields on screen 

@® Unlimited memory arrays 

@ Unlimited size of one and two 
dimensioned memory arrays 


Extraordinary Data Files 

@ Data dictionary 

@ Nine indexes per data file 

@ All indexes in one file 

@ Ascending and descending indexes 

@ B+ tree (CBTREE) state-of-the-art 
speed 

@ Also random, ASCII, and comma 
delimited 


Advanced Fourth Generation 

Language 

@ Event driven architecture 

@ Procedural, with over 120 commands 

@ Lightbar menus 

@ Global subroutines 

@ Global variables 

@ Directory and path management 

@ Two dimensional tables 

@ Variables: string, numeric, 
floating point, Boolean 

@ DOS shell 

@ On line debugger 

@ Julian dates 

®@ Sophisticated editor included 


Report Generator 
@ Relational, multifile WYSIWYG 


CIRCLE 82 ON READER SERVICE CARD 


Modern Transaction Processing 
® Transaction recovery and 
rollback 


Security 

@ Optimizer encrypts and speeds up 
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different compared to similar peripherals found on 
the standard DOS-based PC. Moving an application 
that requires a mouse or graphics display can be 
impossible if the version of UNIX does not support 
these facilities. 

Everyone is familiar with modular design and the 
reasons for it. It will be much easier to port a DOS 
application that isolates system-specific features into 
modules. Modularity will definitely make a difference 
when moving applications to either OS/2 or UNIX. 
How an application is partitioned is important, espe- 
cially for graphic applications. Although there are 
similarities, Windows, Presentation Manager, and X- 
Windows are significantly different in almost all as- 
pects from dialog boxes and menus to drawing func- 
tions and text support. 

The application itself can also be a factor in deter- 
mining the difficulty of porting from DOS to OS/2 or 
UNIX. Applications that require a great deal of proc- 
essing power may find the UNIX environment stifling 
because resources must be shared with other users. 
The throughput to the display terminal may also limit 
what applications will run under UNIX. Single-user 
UNIX workstations fair better than multiuser UNIX 
systems in these aspects, and the single-user version 
is on a par with OS/2 for porting DOS applications. 

It is difficult to make a sweeping statement about 
the suitability of OS/2 or UNIX as platforms when 
porting existing DOS applications. OS/2 tends to have 
a better match because the file system and file names 
are identical and the peripheral environment tends 
to match that found in DOS. Graphics-based applica- 
tions will have the same problems with both operating 
systems although a Windows application will probably 
port to the Presentation Manager much more easily 
than to X-Windows. 


Summary 
OS/2 and UNIX are the two dominant, multitasking 
operating system alternatives available for the PC. 
The former is designed as a single-user environment 
and could grow into a multiuser system while the 
latter is designed as a multiuser platform and per- 
forms well as a single-user environment. Both work 
well in a LAN environment. Although neither provide 
the solution for all problems or all users, there is a 
significant overlap in the solutions the two do provide. 
Porting DOS applications to either OS/2 or UNIX 
can be a major endeavor. Neither operating system 
offers a direct method for porting applications to their 
native environment, although some 80386 versions 
of UNIX do offer DOS compatibility modes similar to 
the DOS compatibility box found in OS/2. These tend 
to be interim solutions, however, and tend to have 
significant limitations which makes it preferable to 
port DOS applications to the native environment for 
integration purposes in addition to taking advantage 
of the features found in these operating systems. 
Porting applications written in high-level languages 
such as C, COBOL, and FORTRAN tend to be the easiest 
approach because of the compatibility between com- 
pilers and the similarity of system functions in DOS, 
OS/2, and UNIX. Problems will tend to occur with 
programs written in assembler and BASIC (due to lack 
of standards for BASIC compilers). Dependence on 
peripherals found on PC-compatible machines is also 
amajor source of problems. As always, a well-designed, 
modular program will be easier to port. oO 
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DATABASE QUERIES 


by P. L. Olympia, Ph.D. 


dBASE III TO dBASE IV: 


Should You 
Switch or Fight? 


nless you’ve been hibernating 
[ during the last few months 

you now know that dBASE IV 
is promised for release at the end of 
July. The amount of press it has been 
getting since the formal announce- 
ment last February 17 leaves little 
doubt that this is one product that 
will have a profound influence on the 
microcomputer market. At the same 
time, it is easy to become confused 
by the daily deluge of information 
from the media about product fea- 
tures and “misfeatures.” Normally, I 
do not discuss unreleased products, 
but the debate that already has sur- 
rounded dBASE IV merits clarification 
and comment. 

In this column, and in future col- 
umns, I hope to be able to provide 
some of the information you need to 
assess whether dBASE IV is for you, 
and what conversion problems, if any, 
you may have in switching over from 
dBASE III Plus. Rather than simply 
recite the list of anticipated changes 
or describe those changes superfi- 
cially, I prefer to focus on a select 
group of features in each issue. How- 
ever, bear in mind that dBASE IV is 
not even in beta test as I write this 
column (early May). Any features I 
discuss are always subject to change. 
By the same token, press reports, 
such as those alluding to the exis- 
tence of 3300 bugs (amazing how one 
can keep score) in an alpha version 
of the product, are meaningless and 
should not be taken seriously. That 
is not to say that dBASE IV will be 
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completely free of bugs when it is 
released. It may not be, but no one 
should be criticized for having pro- 
gram bugs in an alpha release. That, 
after all, is why we have beta tests. 


Overview 

dBASE IV is avery large and complex 
product. If you thought the changes 
from dBASE II to III were mind- 
boggling, you'll appreciate knowing 
that those pale in comparison to what 
dBASE IV offers beyond dBASE III Plus. 
Some of the noteworthy enhance- 
ments are: 


e A pseudo compiler that automati- 
cally compiles a program file (PRG) 
into object form (.DBO). Commands 
such as DO <prg> and SET PROCE- 
DURE TO <proc> will invoke the 
compiler automatically. 

e A new and more powerful, non- 

procedural interface called the Con- 

trol Center that will make you for- 
get the tortuous dBASE Assistant 

(if you haven’t already). Using pull- 

down menus, the Control Center 

gives you access to almost every- 
thing in dBASE IV, and without you 
having to write a single line of 

code. The help system is also im- 

proved. It allows you to branch to 

Rolodex-style help panels of related 

topics. You may invoke the print 

option of a help panel to get a hard 
copy of it. 

Support for Structured Query Lan- 

guage (SQL). SET SQL ON/OFF de- 

termines whether dBASE IV in- 
terprets queries in SQL or native 
mode. SQL queries cause the pro- 
gram to translate the query into 
dBASE language, compile it, then 
execute it. There are no current 
plans to provide access to the in- 
termediate dBASE code for use, 


perhaps, in learning how the trans- 
lation is done. 
Vastly improved multiuser capa- 
bilities. The program provides auto- 
matic record and file locking, auto- 
matic retry, and a facility to indi- 
cate which network user has a lock 
on arecord or file. 

Features such as multiple child re- 

lations, user-defined functions 

(UDF), and arrays for which most 

of us went to competing products. 

dBASE IV UDFs may be written only 
in the dBASE language and not 
traditional programming languages, 
such as C or assembler. Arrays 
may be one- or two-dimensional, 
and may have up to 1025 elements. 

(I will discuss UDFs and arrays in 

later columns.) 

e Ausefulforms manager and report 
writer. You may design data entry 
screens and reports simply by “paint- 
ing” the screen with data fields, 
computed fields, boxes, and text. 
dBASE IV translates those into 
.PRGs which, obviously, can be com- 
piled. 

e Agreatly enhanced dBASE language 

including new commands and func- 

tions. I will be discussing improve- 
ments on the dBASE language in 
more detail later. 


dBASE IV actually comes in two fla- 
vors: the end-user version and the 
developer version. The developer ver- 
sion includes everything offered in 
the end-user version, plus an interac- 
tive debugger, unlimited run time, a 
three-user LAN pack for program de- 
velopment, and additional technical 
reference manuals, including a de- 
scription of how dBASE handles pa- 
rameters to the CALL statement. The 
end-user version costs $795, and the 
developer version costs $1,295. 


Program Limits 

Remember when dBASE II always 
tested your ingenuity because its data 
files could only have 32 fields, and it 
allowed you to open only two data 
files at one time? You may still re- 
member the relief with which you 
greeted the arrival of dBASE III, which 
gave you the facility to have 128 fields 
per file and up to 10 data files open 
at one time. But, just as there is no 
such thing as too much speed or too 
much disk space in the world of com- 
puting, it didn’t take long to outgrow 
even those limits. Here, dBASE IV 
provides some relief, but not much. 
A dBASE IV data file may have up to 
255 fields. Supposedly, up to 99 files 
may be open at one time (compared 
to dBASE III's 15), but like dBASE III, 
only 10 of those may be data files. In 
essence, you still have only 10 work 


Aucust 1988 


areas. 

dBASE IV raises the limit on the 
number of user-defined memory vari- 
ables from 256 in dBASE III to 2048. 
Additionally, it maintains automatic 
printer system variables that are ac- 
cessible by any program. The dBASE 
IV text editor is also much improved; 
it is now able to handle lines up to 
1024 characters long—the same as 
the maximum length of a command 
line, up from 256 in dBASE III. 

The maximum number of proce- 
dures in a procedure file is up from 
32 to 1170. One of the problems with 
dBASE III Plus and its clones is their 
failure to support more than one pro- 
cedure file at a time. There is talk 
that dBASE IV may have multiple pro- 
cedure files open simultaneously, but 
even this may not be necessary. Any 
PRG file may contain a number of 
procedures within it. Once the file is 
compiled to object form, all proce- 
dures embedded within the .PRG file 
are accessible by other command files 
in the application. 


File Structure Changes 

dBASE IV data files have a new field 
type, F, to accommodate floating point 
numbers requiring a high degree of 
precision. The old numeric field type 
N remains fully supported and can 
be used for fixed point numbers to 
help make a comparison between nu- 
meric variables more predictable. In- 
ternally, dBASE IV stores F and N 
variables differently. 

When you create or modify a dBASE 
IV database structure, you may spec- 
ify up to 47 fields in the file to be index 
fields. This fact is recorded in an area 
of the .DBF file that is unused by 
dBASE III so this change preserves 
data file compatibility between the 
two products. When you identify a 
field as a key field, dBASE IV auto- 
matically creates a multiple index 
(MDX) file with the same name as 
the .DBF file. A .MDxX file may contain 
up to 47 index tags, each of which 
consists of a name and an index ex- 
pression. Thus, one .MDxX file serves 
the function of 47 old .NDX files. The 
latter are still supported although they 
should make the list of endangered 
species in no time. With an .MDX file, 
you need to have only one file open 
instead of 47, all tags are updated 
automatically, and the index expres- 
sion also supports descending keys. 
You may have more than one .MDX 
file open at one time, so in principle, 
there is no practical limit on the num- 
ber of indexes concurrently active. 

Lest you think that .MDX key ex- 
pressions are confined to individual 
fields in the file, remember that tags 
containing complex expressions may 
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be assigned using the INDEX com- 
mand. For example: 


INDE ON STR (acct, 4) +SUBS (categ, 4) 
TAG acctype DESC 


defines the tag acctype as a descend- 
ing key built from the two fields acct 
and categ. You may specify the con- 
trolling index in the USE command, 
for example: 


USE mydbf ORDER acct ype OF mymdx 


assigns tag acctype of mymdx.mdx to 
be the controlling index for the data 
file mydbf. dbf. 


dBASE IV raises 
the limit 

on the number 
of user-defined 
memory 
variables to 2048. 


dBASE IV can use and create the 
.NDX files of dBASE III. I am not able 
to tell any difference in the structure 
between .NDX files of the two ver- 
sions. Unless, you have a need to go 
back and forth between dBASE III 
Plus and dBASE IV, there is no advan- 
tage to using .NDX over .MDxX files. 
Naturally, you can’t have a descending 
clause in an INDEX command meant 
to create an .NDX file. 


Preparing for the Transition 

Will your dBASE III Plus programs 
and data files be usable under dBASE 
IV without change? For the most part, 
the answer is “yes.” Since dBASE IV 
automatically compiles programs to 
object form, my favorite technique 


of embedding comments inside IF .F- 
ENDIF blocks should be modified 
slightly to include a TEXT-ENDTEXT 
block, just as you would write the 
comments in Clipper. It is possible 
to have an application system run 
under the two dBASE versions as long 
as you avoid using dBASE IV enhance- 
ments. 

dBASE IV has no problem reading 
dBASE III index, report (.FRM), and 
label (.LBL) files, even though it cre- 
ates .PRG files in place of the latter 
two. Using .PRG files in place of .FRM 
and .LBL has the dual advantages of 
using fewer file handles and being 
able to compile the resulting code. 
dBASE IV also reads dBASE II] memo 
field files (DBT) without difficulty. 

Given that dBASE III Plus applica- 
tions may be ported into dBASE IV 
with a minimum of problems, any 
concerns you may have about switch- 
ing over when the time comes must 
be for reasons other than compatibil- 
ity. As you may have guessed, the 
power that dBASE IV gives you comes 
at the price of requiring more hard- 
ware resources. While dBASE III Plus 
could be run on a machine with dual 
floppies and 256K of RAM, dBASE IV 
requires 640K of RAM, and a disk 
space of at least two megabytes. 

What about retraining? dBASE III 
Plus programmers may require some 
training on the new features and ca- 
pabilities of dBASE IV to gain maxi- 
mum benefit from the software. Any 
experienced dBASE user should be 
able to learn the new features by self- 
study and experimentation. 

I expect more inexperienced users 
to develop custom applications with 
dBASE IV because of the ease and 
power of the Control Center, the forms 
manager, the application generator, 
and the report writer. Next time, I'll 
discuss report writing and memo fields 
support in dBASE IV. oO 
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by A.G.W. Cameron 


Transputers 


THE SCIENTIFIC COMPUTER USER 


and the Sun 386i 


ous about the transputers, and 

particularly about the trans- 
puter boards from MicroWay that have 
been widely advertised. A transputer 
is a CPU chip made by Inmos, a Brit- 
ish firm, which is supposed to be very 
fast both for integer operations (the 
T414 chip) and for floating point opera- 
tions (the T800 chip, a T414 with 
floating point circuitry added). Single 
T800s have been advertised to deliver 
1.5 mflops (millions of floating point 
operations per second). However, the 
feature for which transputers are most 
noted is that they can be hooked up 
to operate in a parallel network, thus 
giving extremely fast throughput for 
certain kinds of problems. 

Therefore, it was with a consider- 
able degree of interest that I attended 
a single-day transputer seminar hosted 
by MicroWay. It turned out that the 
advertised 1.5 mflops is quite mis- 
leading; that value is for operations 
on 3 x 3 matrices that can be installed 
on internal memory in the transputer 
chip. A somewhat more realistic prob- 
lem such as the LINPACK, which in- 
volves matrices of at least 100 x 100, 
gives only 0.36 mflops as reported to 
me by one of the attendees who had 
made the measurement. This is about 
twice as fast as a microVAX II. Several 
other people mentioned other prob- 
lems that also ran about twice as fast 
on a single transputer (a Monoputer) 
as on a microVAX II. 

This performance would have been 
considered extremely good in 1986, 
and quite good in 1987, but in 1988 it 
is so-so. Part of the reason is that I 
am beginning to detect a widespread 
sentiment that “if it’s DEC, it’s slow!” 
However, since MicroWay will sell 
you a 20-MHz Monoputer with 2 mega- 
bytes of RAM that inserts into an XT 
or AT bus for $1,995, this is still a 
very good value for the money. For 
the same price they will sell you a 
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20-MHz Weitek 1167 daughter board 
that can be used with certain 386 
machines to give about four times the 
speed of a microVAX (see my column, 
“Number Crunching Coprocessor 
Boards,” Micro/Systems, February 
1988). Thus, you have little to gain 
unless you can use a network of 
transputers. 

Asingle T414 chip contains several 
distinct components. The main com- 
ponent is a 32-bit CPU. This talks to 
a bus on which there is a block of 
circuitry providing system services. 
There are 4 kilobytes of on-chip RAM, 
there isamemory interface, and there 
are 4 link interfaces. The T800 then 
adds a 64-bit floating point unit to the 
chip. This follows the IEEE-754 stan- 
dard for floating point arithmetic ex- 
cept that it does not support transcen- 
dentals or trigonometric operations. 

The link interfaces provide a means 
of connecting transputers together 
into anetwork. MicroWay sells a board 
called a Quadputer that contains four 
transputers that can be connected to- 
gether in several possible ways. The 
transputers may be either T414s or 
T800s in any desired mix. One of the 
transputers talks to the PC mother- 
board and it and the rest of the 
transputers talk to one another. The 
Quadputer board may be only par- 
tially populated, leading to Biputers 
and Triputers. The Quadputer may 
typically be set up with 1 or 4 mega- 
bytes of RAM per transputer. If you 
want more memory than the 2 mega- 
bytes provided with a Monoputer, I 
suspect that MicroWay will be willing 
to sell you a Quadputer with only one 
transputer on it. MicroWay is also 
planning to offer more flexible mem- 
ory options in the future. To get a 
rough idea of prices, assume it will 
cost $1,500 for each T414 with 1 mega- 
byte of RAM, $2,000 for each similar 
module where you substitute a T800 
for a T414, and $1,400 for each up- 
grade of 1 megabyte to 4 megabytes. 

Parallelism in computing can mean 
a lot of different things. The lowest 
level of parallelism is a vector proces- 
sor in which successive steps of an 


instruction set are overlapped for a 
stream of incoming data. In a tightly 
coupled computer architecture, there 
may be many CPUs executing the 
same instruction set with multiple in- 
put data streams. A transputer net- 
work isa loosely coupled architecture 
with each CPU having its own task 
(and accompanying instruction set) 
and working with its own set of data. 
As a consequence, in a transputer 
network,communicationbetweentrans- 
puters takes place only when both 
sides of the communication link are 
ready for it. In practice, what this 
means is that the transmitting 
transputer sends a byte of data and 
waits for acknowledgment. When the 
receiver is ready to receive, it takes 
the first byte out of its buffer, sends 
the acknowledgment, and then the 
communication proceeds. The trans- 
mitter need not be suspended while 
waiting because it may have anumber 
of parallel tasks to perform, of which 
the transmission is only one. 

Acommon linkage arrangement for 
a Quadputer is what is called a trans- 
puter farm, in which one transputer 
talks to the external bus and runs a 
main control program. This transputer 
delegates tasks to the other three 
transputers, called “workers.” 

In the future, it is planned to in- 
crease the transputer operating speed 
to 30 MHz, which will result in a 50 
percent increase in throughput, un- 
less that throughput is limited by the 
speed of the external bus. 


Transputer Software 

Aspecial language called OCCAM was 
developed for programming trans- 
puters at the same time that the trans- 
puter was designed. In order to achieve 
greatest efficiency at present, pro- 
grams must by written in OCCAM. 
Programs written in conventional high- 
level languages, such as C, FORTRAN, 
or Pascal, can have their tasks sent 
out to other transputers for execution 
if special versions of those languages 
are used to compile components of a 
problem, and if those components 
are linked together by a framework 
of channels written in OCCAM. This 
framework is then referred to as a 
“harness.” 

At the lowest level in OCCAM there 
are three types of process. An assign- 
ment sets a variable equal to the value 
of an expression: v : = e. An input 
receives a value from a channel c and 
assigns it to a variable x; the notation 
is c ? x. An output places the value of 
an expression é in the channel c, with 
notation c ! e. 

These processes can be combined 
to form a construct, which is itself a 
process, and which in turn can be a 
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component of other constructs. Ex- 
amples of a few kinds of constructs 
are as follows: A sequential construct 
consists of a header SEQ followed by 
a list of component processes that are 
to be executed in the order indicated. 
A parallel construct consists of a 
header PAR followed by a list of proc- 
esses that are to be executed together 
(in any order), and which are called 
concurrent processes. Communica- 
tion, consisting at the lowest level of 
a combination of an input and an out- 
put process, is itself a construct. A 
conditional construct, consisting of a 
set of processes, one of which will 
be executed when a condition be- 
comes true, is written as a header JF 
followed by pairs of condition state- 
ments and the action to be taken. The 
alternative construct ALT is similar, 
but in this case a list of alternative 
input conditions is associated with 
actions to be taken, only one of which 
is performed, depending upon which 
input first becomes ready. There are 
loop (WHILE) and selection (CASE) 
constructs. The equivalent of a DO 
loop, known as replication, may be 
written as SEQ i = 0 FOR n or as PAR 
i = 0 FOR n. The language goes on to 
define types, declarations, procedures, 
functions, expressions, timers, periph- 
eral access mechanisms, and configura- 
tion mechanisms, including a two- 
level priority declaration. 

It may be seen that OCCAM differs 
from aconventional programming lan- 
guage primarily in its treatment of 
concurrent operations, including sig- 
naling. Each process has its own work- 
space and linkage area allocated in 
memory. The local workspace con- 
tains local variables and arrays. The 
linkage information contains pointers 
that the transputer uses to do house- 
keeping on multitasking and I/O pro- 
tocols. Processes run to completion 
unless they are de-scheduled or sus- 
pended. De-scheduling occurs if the 
process is waiting for input, output, 
or a timer event. A low-priority proc- 
ess is suspended if a high-priority 
process becomes ready to run. 

Various efforts are under way to 
write a transputer operating system. 
At present, OCCAM simply executes 
a single program unless several inde- 
pendent tasks are written together 
into one program, which would be 
awkward. Two versions of “Parallel 
C” are in late stages of development. 
Promised for the future are “Parallel 
FORTRAN” and “Parallel Pascal.” 


The Sun 386i Workstation 

On the same day as MicroWay’s 
transputer seminar, Sun Microsystems 
introduced its new 386i workstations. 
I went into Boston the following day 
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to see them demonstrated and I was 
very impressed. 

These workstations compete with 
the top-of-the-line 386 computers from 
IBM, Compaq, and AT&T. They come 
in two flavors: The 386i1/150 uses 20- 
MHz 386 and 387 chips, and it is 
optional whether you purchase a 32- 
kilobyte cache memory with the sys- 
tem. The 386i/250 is the top of the 
line, using 25-MHz 386 and 387 chips 
and with the cache as standard equip- 
ment. The introduction of the 250 
coincided with Intel’s announcement 
of the availability of the 25-MHz chips. 
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There are 4 megabytes of RAM stan- 
dard with the 150, and 8 megabytes 
of RAM standard with the 250, and 
both may be expanded to 16 mega- 
bytes. 

However, the most distinctive as- 
pects of the 386i are in the realm of 
software. It is a UNIX machine in which 
Sun has melded the AT&T System V.3 
and the 4.3/4.2 BSD versions of UNIX, 
and support for DOS 3.3 has been 
intimately integrated into the operat- 
ing system. Several DOS sessions may 
be run simultaneously in windows; 
Sun calls this its DOS Windows facil- 
ity. One limitation that seems unnec- 
essarily restrictive to me is that the 
amount of extended memory that can 
be associated with a DOS Window is 
only 2 megabytes. Sun does not in- 
tend to support OS/2 at present, since 
both UNIX and the DOS Windows sys- 
tem are inherently multitasking. 

Sun is intending to storm the cor- 
porate PC market with these ma- 
chines. This means that, in order to 
be credible and also to be user- 
friendly, it has had to write a com- 
pletely new UNIX front end, called the 
Sun Organizer. This provides the use 
of screen icons and a mouse, and 
thus it resembles the user environ- 
ment of the Macintosh (or perhaps, 
in these litigious times, one should 
say that both resemble the user inter- 
face designed at the Xerox Palo Alto 
Research Center). Thus the normal, 


bare-bones UNIX interface is well hid- 
den from the user, although Sun says 
that confirmed UNIX users will be 
able to use this interface if they want 
to. Among other things, the Sun Or- 
ganizer presents the user with a dia- 
gram of the directory tree structure 
and makes it easy to navigate through 
that structure. There is a “Help” key 
that gives a help message about the 
operation being attempted, or about 
subtopics of that operation; this uses 
a hypertext facility. 

UNIX is, of course, a huge operat- 
ing system, and so a big disk is needed 
with the 386i. That big disk may be 
on an Ethernet network server; Sun 
emphasizes the use of the network 
for connecting groups of these work- 
stations for a given operation within 
a corporation. But 91- and 327- 
megabyte disks are available for use 
directly with the workstation. There 
are SCSI interfaces for these disks; 
the 91-megabyte disk has an 18 milli- 
second seek time and the 327- 
megabyte disk has an even shorter 
seek time at 16.5 milliseconds. Using 
a small expansion unit, up to three 
327 megabyte disks can be installed 
together with a 60-megabyte tape car- 
tridge drive. 

Within a few months the 386i will 
also be of considerable interest to the 
scientific and technical market. A FOR- 
TRAN compiler (to run under UNIX) 
is currently in beta test. The machines 
currently carry a 387 coprocessor, 
but this plugs into a Weitek socket, 
and the Weitek 1167 coprocessor 
board is expected to be available to 
plug into this socket about in Septem- 
ber (this is currently waiting for the 
availability of 25-MHz Weitek chips). 
With a 386i thus enhanced, it should 
run floating point problems faster than 
any other existing Sun workstation, 
which will be of particular interest to 
scientists and engineers. In Septem- 
ber, Sun is expected to announce a 
much faster SPARC-based workstation, 
but it will be much more expensive 
than the 386i. ao 
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by Robert Blacher 


IN THE PUBLIC DOMAIN 


The Shareware Alternative 


out of the way first. The title 

of this column notwithstand- 
ing, there is almost no “public do- 
main” software available for down- 
loading from DOS bulletin board sys- 
tems (BBS). A program is in the pub- 
lic domain only if the author claims 
no rights to the program. That’s a 
mistake. If a program is public do- 
main, someone else can actually sell 
the program without violating the 
author’s rights. 

Most software on BBS either is, or 
should be, copyrighted. What we 
loosely refer to as public domain re- 
ally is copyrighted software that the 
author is distributing for free. The 
author hasn’t abandoned all legal 
rights to the software; you’re just not 
being charged for the license to use it. 

The other major category of soft- 
ware on BBS systems is variously re- 
ferred to as shareware, freeware (so 
much for the English language— 
freeware isn’t free), or user-supported 
software. Again, the software is copy- 
righted, but this time you are not 
being given the right to use the soft- 
ware for free. The author is providing 
you with an opportunity to try the 
software and see ifit meets your needs. 
If you find yourself using the pro- 
gram regularly, you are requested to 
send in a license fee. 

When I first started running an 
RCP/M over four years ago, almost 
all of the software was free. (Actually, 
I can’t recall any CP/M programs for 
which the author requested money.) 
The same still holds true for UNIX 
software. Almost all software that is 
distributed over USENET, the anar- 
chic “uucp” network that distributes 
mail and programs to thousands of 
UNIX sites worldwide, is free. There 
has been hot and heavy discussion 
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on the net from time to time about 
whether shareware should even be 
allowed. 

I don’t have any problem with share- 
ware. Many authors of shareware pro- 
grams spend a great deal of time 
creating, testing, and supporting their 
programs. It seems only fair to ask 
users of those programs to contrib- 
ute financially. But, a lot of authors 
have unrealistic expectations of the 
returns from the shareware “market” 
and are disappointed when they re- 
ceive few registrations. That’s lead- 
ing to yet another category of soft- 
ware on BBS systems called “cripple- 
ware”—programs where the BBS ver- 
sion is intentionally limited in some 
way to encourage users to register 
the program and receive a full operat- 
ing version. 

I do have a problem with cripple- 
ware. But, rather than going off on a 
tirade about that subject, I want to 
talk about reasonable expectations 
with regard to BBS-distributed soft- 
ware for both users and authors. Crip- 
pleware stems from unrealistic no- 
tions about getting rich from share- 
ware programs. Forget it. A little 
thought about what’s really going on 
out in the BBS world may help clear 
up those misconceptions and save 
us all a lot of grief. 


Why Shareware? 

Let’s look at it first from the caller’s 
perspective. There are three major 
categories of PC applications: (1) word 
processors; (2) spreadsheets; and (3) 
database managers. Most of us bought 
our PCs precisely because we wanted 
to use them for one or more of these 
tasks. Is it realistic to think that some- 
one who uses his PC primarily for 
word processing is going to forsake 
WordPerfect or Microsoft Word in 
favor of a program downloaded from 
a BBS? Will he give up Lotus 1-2-3 for 
a shareware spreadsheet? Will he aban- 
don dBASE for a file manager found 
on a BBS? C’mon now! 

I’m not saying there aren’t good 
programs in each of these categories 
available on BBS systems. There most 
certainly are: WYWord and Galaxy 


are both good word processors; PC- 
Calc Plus, QubeCalc, and Insta- 
Calc are fine spreadsheets; and File 
Expressand PC-File Plus are com- 
petent, easy-to-use file managers. But 
look at the competition. It costs about 
$200 to pick up a copy of WordPerfect 
by mail order these days. I recently 
bought Borland’s Quattro for $99. Re- 
flex goes for about the same, and 
Symantec’s Q & A, which combines 
simple word processing with power- 
ful database capabilities, is advertised 
by mail order for $209. Sure, dBASE 
and the like go for a lot more, but 
there isn’t a full-fledged, relational 
database manager with a procedural 
language available on BBS systems. 

In short, the competition from com- 
mercial programs in these three cate- 
gories is very stiff indeed, and some 
of the programs aren’t selling for much 
more than their shareware counter- 
parts. That doesn’t mean there’s no 
hope for shareware in these areas. 
Someone who uses a computer pri- 
marily for word processing may find 
that File Express is exactly what he 
needs in the database area. Hope- 
fully, that person, and others, will 
register the program if he continues 
to use it and bring the Expressware 
folks their just returns. But, I doubt 
this is where the future of shareware 
really lies. 

Shareware excels at doing things 
the commercial software houses can’t 
do. The game isn’t to compete with 
Microsoft, Lotus, and Ashton-Tate. 
Rather, it’s to figure out what needs 
those companies can’t serve and tar- 
get those areas. The area of DOS 
utilities is a good place to start, and 
Vern Buerg’s indispensable LIST pro- 
gram is a perfect example. 


LIST 

LIST is a file viewer, and a whole lot 
more. It’s perfectly silly, but DOS fails 
to provide a convenient way to look 
at text files that are larger than one 
screen. If you TYPE such a file, you’d 
better have fast fingers or it will go 
scrolling off your monitor. DOS does 
provide the MORE program, but it’s 
pretty clumsy stuff. Ifyou enter MORE 
< TEXT-FIL, you'll get the file one 
page at a time. But look out. Enter 
MORE > TEXT.FIL and you’ve just 
erased it! Is this any way to run an 
operating system? 

You could load your favorite word 
processor to read text files, but that’s a 
case of gross overkill. LIST offers the 
ideal solution. Into a tiny 8K .COM 
file, Buerg has packed an incredible 
array of capabilities. Of course, it al- 
lows you to page through a file, using 
the Home, End, PgUp, PgDn, and 
arrow keys that are intuitive and 
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handy. That’s just the beginning. Experi- 
enced users of LIST know you can use 
it to search through a file, mark a 
block of text and dump it to another 
file, print all or part of the file, put a 
ruler on the screen to help you count 
column positions, and even examine 
binary files in hex mode. There’s much 
more to be found in this wonderful 
program. 

As the above makes clear, I think 
LIST is invaluable. But in monetary 
terms, how much is it worth? Buerg 
requests a contribution of $15. I hap- 
pen to think that’s a steal and suspect 
the program would readily sell for 
more, but there’s an upper limit for 
a “small” program of this kind. $25? 
$35? $50? I’m not exactly sure, but it’s 
somewhere in that range. 

Look around at some of the soft- 
ware ads. How many $15 programs 
do you see? Precious few, if any. It’s 
just not practical for a major software 
house to market a program at that 
price. With the costs of development, 
testing, marketing, advertising, and 
distribution, it’s impossible for a Mi- 
crosoft to sell a “small” program, no 
matter how badly needed that pro- 
gram is. Right there, you begin to see 
at least one market niche for share- 
ware. There are whole categories of 
indispensable, “small” programs that 
just can’t be marketed commercially. 
Shareware provides a perfect, alter- 
native method of distribution. 


What’s Hot on BBS? 

In general, you'll find utility programs 
on BBS systems that simply have no 
parallel in the commercial market. 
Here’s a quick look at a few more 
that you'll find on any good BBS: 


e PCOPY: Created by Norm Patri- 
quin, this excellent program has 
all the capabilities of DOS’s XCOPY 
and REPLACE programs and much 
more. You can move files, as well 
as copy them, and the selection 
criteria are very extensive. 


© OVERVIEW isan excellent, sweep- 
type file manager that allows you 
to view all files in a directory and 
tag them for copying, erasure, etc. 
There are commercial programs 
in this area, such as XTREE, but 
the BBS world really has those beat. 


¢ FGREP Written by Chris Dunford, 
this is a text search utility. DOS’s 
FIND command is a pale counter- 
part to this program in terms of 
both capabilities and speed. And 
FGREP is free! 


© SORTF: Also created by Vern 
Buerg, the DOSSORT program just 
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can’t hold a candle to this one. 
DOS’s program drops dead on files 
larger than 64K. Buerg’s program 
will sort much larger files and do 
so faster and with more options. 
SORTF is also free. 


¢ WSSINDEX: This shareware pro- 
gram keeps a catalog of what’s on 


your floppy disks. WSSINDEX al-- 


lows you to add comments about 
each file, catalog ARChive directo- 
ries, and sort and print the catalog 
in a variety of ways. 


e MARKand RELEASE: These excel- 
lent programs by Kim Kokkonen 
give you control over all those TSRs 
you're loading these days. With 
this package, you can remove some 
or all resident programs from mem- 
ory when you need to free up RAM 
for large applications. MARK and 
RELEASE are free, and source code 
is available. 


People who haven’t looked at the soft- 
ware available on BBS systems are 
missing out on programs like these. 
It’s not just a question of saving money. 
There’s software available on BBS sys- 
tems that simply has no commercial 


counterpart because it’s economically 
impractical for companies to market 
low-cost programs of this kind. 

If you’re an author of a program 
and are considering distribution via 
BBS systems, let me urge you to go 
into this venture with your eyes wide 
open. Think about the programs that 
are already available, both in the com- 
mercial and the shareware arenas. 
Then, think hard about why you’re 
doing this. If your primary motivation 
is financial, you’re likely to be disap- 
pointed. Regardless of efforts to beg, 
bribe, or otherwise coerce shareware 
program users, most users won't con- 
tribute. Ask a public television station 
how many of its viewers pay the yearly 
membership fee. Shareware is no dif- 
ferent, and most users will take the 
“free ride.” But, if your program truly 
meets a need and you get it into the 
hands of enough people, you can hope 
for some fair reward—if not wealth, 
at least some congratulatory com- 
ments in messages on BBS systems 
and a favorable mention in future edi- 
tions of this column. a 
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CP/M diskettes on your PC or AT. 


UniForm-PC 
Read, write, copy, and format diskettes for over 275 CP/M and MS-DOS computers on 
your PC or AT, including 3¥2", 96 TPI, high density, and 8” formats. Use your DOS 
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Run CP/M programs in your PC or AT with this 8 Mhz. Z80H half-card. UniDOS stays 
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New Products 


New Software 
Products 


Novell Branches IntoAp- 
ple’s Network Market 
Novell, Inc., recently un- 
veiled the latest version of 
its network operating sys- 
tem, NetWare Version 2.15, 
which will support Apple 
technology as well as DOS 
and OS/2. NetWare v2.15 
is AppleShare compatible 
and allows users in an Ap- 
pleTalk Network System to 
become fully NetWare com- 
patible. Each workstation 
uses its native operating sys- 
tem, so users of Apple, PC, 
and PS/2 workstations 
each operate in their own 
environment while sharing 
files and data across the 
network without conver- 
sion. NetWare for the Macin- 
tosh may be installed in 
either the NetWare file 
server or in an external Net- 
Ware bridge workstation, 
and provides access to Ap- 
ple LaserWriter printers to 
all network users. NetWare 
v2.15 also offers Mac users 
access to Novell’s System 
Fault Tolerance (SFT) and 
security. NetWare also sup- 
ports AppleTalk protocols 
inside the server, and pro- 
vides full support of Ap- 
pleShare security and pri- 
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Novell’s Netware, Version 2.15 supports DOS, Apple, and O: 


vacy controls, and provides 
NetWare security and ac- 
counting functions to Ap- 
pleShare. The initial release 
includes support for Local- 
Talk and EtherTalk adapt- 
ers for the Macintosh, and 
future compatibility is 
slated for Token-ring, ARC- 
net, and other adapters. 

NetWare Version 2.15 is 
scheduled for distribution 
in the fourth quarter in the 
SFT ($4,695), and Advanced 
($2,695) versions. Entry 
Level System (ELS) Level 
II will be available early in 
1989 for $1,395. For exist- 
ing NetWare v2.1 andv.2.11 
sites, NetWare v2.15 will 
be available as an upgrade 
for $200 per file server. For 
more information, contact 
Novell, Inc., 122 East 1700 
South, Provo, UT 84601; 
(801) 379-5900, or contact 
your Novell authorized re- 
seller. 

Circle number 15 on the 
reader service card 


Toolkit Hammers DOS 
Into Shape for UNIX 
The MKS Toolkit Version 
2.3 adds more than 115 
UNIX System V-compatible 
commands to DOS on the 
PS/2, PC, and compatibles. 
Included in MKS Toolkit is 
a parser-generator lan- 
guage, YACC (Yet Another 


NefWare 


S/2 


Compiler Compiler), that 
takes context-free gram- 
mar, converts it into a set 
of tables, and generates a 
program that will recognize 
that language. MKS YACC 
is fully compatible with 
UNIX YACC. Also included 
is a spell function, a diff 
command to compare three 
versions of a text file, and 
data compression. 

The MKS Toolkit lists for 
$169 and is available to run 
under DOS 2.0 an later. For 
more information, contact 
Mortice Kern Systems 
Inc., 35 King Street North, 
Waterloo, Ontario, N2J 
2W9; (519) 884-2251. 
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New Hardware 
Products 


Laptop Analyzer Can 
Smell Trouble on LANs 
Network General Corpora- 
tion has introduced a lap- 
top version of its Sniffer, 
Model PA-302. Housed in 
a Toshiba T3200 with a 12- 
MHz 286 processor, the Lap- 
top Sniffer is a portable pro- 
tocol analyzer and diagnos- 
tic tool for Ethernet-based 
networks. The Laptop Snif- 
fer operates with all Eth- 
ernet protocols, including 
TCP/IP, DECnet, XNS, NFS, 
ISO, APPC, NETBIOS, and 
NetWare. Network General 
is also expanding its Snif- 
fer series to support Token- 
ring, ARCnet, and StarLAN 
technologies. The self- 
contained unit can capture 
network data transmissions 
and break them down to 


isolate problems that de- 
grade network perform- 
ance. The Laptop Sniffer 
includes Version 1.3 of the 
LAN troubleshooting soft- 
ware that offers graphical 
display of network activity, 
frame size control, an im- 
proved frame capture rate, 
traffic generator capabili- 
ties, and detailed protocol 
analysis. The new software 
version also includes Tele- 
Sniffer, which enables the 
laptop computer to conduct 
LAN diagnosis via tele- 
phone lines. 

The Laptop Sniffer is 
priced at $15,000. For more 
information, contact Net- 
work General Corpora- 
tion, 1945A Charleston 
Road, Mountain View, CA 
94043; (415) 965-9608. 
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WAN Controllers Con- 
form to OSI Standard 
Retix has unveiled the PC 
300 series of Wide Area 
Network Controllers to con- 
nect PCs, XTs, ATs, and 
PS/2 units to OSI WANs 
through leased lines. This 
series of WAN controllers 
supports Open Systems In- 
terconnection (OSI) and 
both the connection-ori- 
ented (CONS) and connec- 
tionless (CLNS) network 
services. The PC-320 has 
512K of RAM and an on- 
board, low-speed RS-232 
port. Other versions can be 
configured with up to 1.5 
MB of RAM, and a high- 
speed port (up to 64 kbps) 
is optional. The PC-321 sup- 
ports RS-449, RS-530, or 
X.24 connections. The PC- 
322 supports RS-232 or V.35 
connections. Software pro- 
vided with the controllers 
includes the SW-330 pack- 
age with ISO Transport 
Class OS/2 and 4, the ISO 
Internet Protocol IP to x.25, 
Subnetwork Dependent 
Convergence Protocol 
(SNDCP), x.25 Packet- 
Level Protocol, and HDLC. 

The PC-300 is scheduled 
for shipment in the fourth 
quarter. For more informa- 
tion, contact Retix, 2644 
30th Street, Santa Monica, 
CA 90405; (213) 399-2200. 
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X-BANDIT 


Designed for the EMS 4.0 Standard 


DESIGN PHILOSOPHY 


e The Teletek X-Bandit was specifically designed to utilize 
the advanced features of the Lotus/Intel/Microsoft EMS 4.0 
Specification. It is available in both 8 and 16 bit versions 
for use in the IBM XT, AT, and compatibles. 


MEMORY 


e Segmented Memory Mapping allows the user to fill out 
unused memory segments between 640K and 1 Megabyte. 


e Split Memory Addressing allows the user to fill out con- 
ventional memory to 640K. 


e Extended Memory Addressing is available for the PC/AT 
version. 


¢ 2 MB capacity in a single slot. Up to 8 MB per system. 
e Parity checking. 


SOFTWARE 
¢ Easy menu-driven auto configuration software. 


e Device driver includes print spooler and RAM drive. 


e Supports multitasking with the appropriate shell-resident 
software package. 


SPEED 


e 6/8/10/12 MHz speed with 0 wait states. 16 MHz speed 
with 1 wait state. 


WARRANTY 
e One year parts and labor. 


Sacramento, CA 95838 
™~ (916) 920-4600 
Telex #499-1834 
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DOS system running Lotus 1-2-3 


THIS IS AN IBM 
PS/2 MODEL 80 
RUNNING DOS 


Under DOS, this PS/2™ is a powerful 80386-based single- 
tasking, single-user computer that can run thousands of 
DOS applications. In 16-bit, 8086 mode. 

One at a time. 

When OS/2™ software becomes available, the PS/2 can 
become a multitasking, single-user computer running in 
16-bit, 286 mode that can also single-task those DOS 
applications under OS/2. 


One at a time. 
With DOS or OS/2, the PS/2 will support one user. 


1 user (DOS) 1 user (OS/2) 
Cost per system**: $12,389 $12,594 
Cost per user: $12,389 $12,594 


*SCO VP/ix available as sepa: 
** Cost comparisons are b 


ASE HII PLUS.® | -user OS/2 system: |-user [ 
rkalike ). 9-user SCO XENIX system: | -user SCO XENIX system 
onal RAM, additional 70Mb disk. 


Mode! 80, 70Mb disk, 1Mb RAM, IBM 8512 color 1 
DOS. L-user SCO XENIX system: Base machine, plus SCO XENIX 386 
gent 8-user multiport card, 8 IBM 315! ASCII terminals. 33-user SCO XENIX sy: 


tly published lS. domestic suggested list prices. Cost model: Base machine: IBM PS/2 


al System/2, PS/2 and OS/2 are trademarks of International Business Machines Corporation. * Lot 
© VP/ix is a trademark of INTERACTIVE Systems, Inc. * Lyrix is a registered tradem 
PO, Box 1900, Santa Cruz, CA 95061 The Sant 
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SCO XENIX system running SCO Professional 


THIS IS AN IBM 
PS/2 MODEL 80 
RUNNING SCO XENIX 


Under SCO XENIX,” this PS/2 becomes a powerful 80386- 
based multitasking, multiuser computer that can run thou- 
sands of XENIX applications. In full-tilt, 32-bit, 386 mode. 
Many at a time. . 

And using SCO VP/ix;* the PS/2 can multitask DOS 
applications under SCO XENIX. 

Many at a time. 

With SCO XENIX, the PS/2 will support one user. 

Or 9 users. Or even 33 users. 

And it can do all that today because you can get SCO 
XENIX for the PS/2— now! 


1 user 9 users 33 users 


Cost per system**: $14,559 $19,726 $40,402 
Cost per user: $14,559 $2,192 $1,224 


SCO XENIX System V and the SCO XENIX family of software solutions 
are available for all industry-standard 8086-, 80286-, and 80386-based 
computers, and the IBM® Personal System/2™ Models 50, 60, and 80. 


SS" (800) 626-UNIX (626-8649) 
3 (408) 425-7222 
FAX: (408) 458-4227 


= 7 TWX: 910-598-4510 sco sacz 
THE SANTA CRUZ OPERATION uucp: ...decvax! microsoft! sco! info! 


ater XL® L-user DOS system: Ba 
sing). SCO FoxBASE+™ (dBASE II! PLL 


intelligent 8-user multiport cards, 24 mi 


IMb additional IBM RAM, IBM Pre 
2, SCO VP/ ix, SCO Lyrix® (word pr 
user SCO XENIX System, plus 3 mc 


lemarks of Lotus Development Corporation. * dBASE iil PLUS is a registered trademark of Ashton-Tate, 
of The Santa Cruz Operation, Inc. * Gt FoxBASE+ is a trademark of Fox Software, Inc. 10/8 
don WIA 4YN United Kingdom, +44 1 439 2911, (FAX): +44 1 637 9381, TELEX: 917372 soo tas 


